Amino acid dipepetide frequency for Lactococcus phage 98204

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.907AlaAla: 3.907 ± 0.805
0.665AlaCys: 0.665 ± 0.255
3.159AlaAsp: 3.159 ± 0.559
3.657AlaGlu: 3.657 ± 0.774
3.491AlaPhe: 3.491 ± 0.509
4.239AlaGly: 4.239 ± 0.712
0.831AlaHis: 0.831 ± 0.277
4.073AlaIle: 4.073 ± 0.851
5.901AlaLys: 5.901 ± 0.994
6.234AlaLeu: 6.234 ± 0.701
1.579AlaMet: 1.579 ± 0.319
5.236AlaAsn: 5.236 ± 0.59
1.33AlaPro: 1.33 ± 0.284
2.743AlaGln: 2.743 ± 0.645
1.829AlaArg: 1.829 ± 0.555
3.657AlaSer: 3.657 ± 0.66
3.907AlaThr: 3.907 ± 0.652
3.823AlaVal: 3.823 ± 0.6
1.745AlaTrp: 1.745 ± 0.434
2.161AlaTyr: 2.161 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.083CysAla: 0.083 ± 0.084
0.0CysCys: 0.0 ± 0.0
0.831CysAsp: 0.831 ± 0.294
0.748CysGlu: 0.748 ± 0.235
0.249CysPhe: 0.249 ± 0.167
0.748CysGly: 0.748 ± 0.38
0.166CysHis: 0.166 ± 0.122
0.416CysIle: 0.416 ± 0.147
0.416CysLys: 0.416 ± 0.218
0.083CysLeu: 0.083 ± 0.09
0.0CysMet: 0.0 ± 0.0
0.416CysAsn: 0.416 ± 0.22
0.249CysPro: 0.249 ± 0.153
0.0CysGln: 0.0 ± 0.0
0.249CysArg: 0.249 ± 0.128
0.582CysSer: 0.582 ± 0.255
0.249CysThr: 0.249 ± 0.166
0.249CysVal: 0.249 ± 0.117
0.083CysTrp: 0.083 ± 0.083
0.083CysTyr: 0.083 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
4.322AspAla: 4.322 ± 0.648
0.499AspCys: 0.499 ± 0.24
4.073AspAsp: 4.073 ± 0.686
4.821AspGlu: 4.821 ± 0.721
3.325AspPhe: 3.325 ± 0.697
4.488AspGly: 4.488 ± 0.797
0.665AspHis: 0.665 ± 0.283
4.904AspIle: 4.904 ± 0.743
4.405AspLys: 4.405 ± 0.511
4.821AspLeu: 4.821 ± 0.585
1.247AspMet: 1.247 ± 0.423
3.325AspAsn: 3.325 ± 0.457
1.247AspPro: 1.247 ± 0.448
1.081AspGln: 1.081 ± 0.315
2.244AspArg: 2.244 ± 0.321
4.239AspSer: 4.239 ± 0.641
3.657AspThr: 3.657 ± 0.434
3.99AspVal: 3.99 ± 0.551
1.081AspTrp: 1.081 ± 0.306
3.075AspTyr: 3.075 ± 0.47
0.0AspXaa: 0.0 ± 0.0
Glu
3.159GluAla: 3.159 ± 0.538
0.416GluCys: 0.416 ± 0.179
2.078GluAsp: 2.078 ± 0.371
5.569GluGlu: 5.569 ± 1.053
3.574GluPhe: 3.574 ± 0.709
2.909GluGly: 2.909 ± 0.493
1.164GluHis: 1.164 ± 0.388
3.99GluIle: 3.99 ± 0.612
6.733GluLys: 6.733 ± 1.118
6.816GluLeu: 6.816 ± 1.145
2.494GluMet: 2.494 ± 0.466
3.242GluAsn: 3.242 ± 0.642
1.662GluPro: 1.662 ± 0.469
3.408GluGln: 3.408 ± 0.516
3.075GluArg: 3.075 ± 0.493
3.242GluSer: 3.242 ± 0.494
4.322GluThr: 4.322 ± 0.822
4.488GluVal: 4.488 ± 0.63
1.33GluTrp: 1.33 ± 0.366
2.992GluTyr: 2.992 ± 0.581
0.0GluXaa: 0.0 ± 0.0
Phe
2.41PheAla: 2.41 ± 0.423
0.332PheCys: 0.332 ± 0.159
4.073PheAsp: 4.073 ± 0.499
3.159PheGlu: 3.159 ± 0.579
2.244PhePhe: 2.244 ± 0.377
2.743PheGly: 2.743 ± 0.456
0.416PheHis: 0.416 ± 0.197
3.408PheIle: 3.408 ± 0.514
4.239PheLys: 4.239 ± 0.685
2.826PheLeu: 2.826 ± 0.48
1.496PheMet: 1.496 ± 0.416
2.577PheAsn: 2.577 ± 0.417
1.33PhePro: 1.33 ± 0.417
1.995PheGln: 1.995 ± 0.385
1.33PheArg: 1.33 ± 0.408
3.823PheSer: 3.823 ± 0.573
2.909PheThr: 2.909 ± 0.599
1.662PheVal: 1.662 ± 0.37
0.249PheTrp: 0.249 ± 0.161
1.912PheTyr: 1.912 ± 0.367
0.0PheXaa: 0.0 ± 0.0
Gly
3.408GlyAla: 3.408 ± 0.721
0.499GlyCys: 0.499 ± 0.173
3.408GlyAsp: 3.408 ± 0.488
2.743GlyGlu: 2.743 ± 0.492
3.075GlyPhe: 3.075 ± 0.526
5.32GlyGly: 5.32 ± 1.684
0.665GlyHis: 0.665 ± 0.205
5.153GlyIle: 5.153 ± 0.574
6.649GlyLys: 6.649 ± 0.875
4.572GlyLeu: 4.572 ± 1.126
1.579GlyMet: 1.579 ± 0.416
3.574GlyAsn: 3.574 ± 0.903
0.748GlyPro: 0.748 ± 0.287
2.66GlyGln: 2.66 ± 0.631
3.242GlyArg: 3.242 ± 0.562
4.572GlySer: 4.572 ± 0.758
4.821GlyThr: 4.821 ± 0.549
3.491GlyVal: 3.491 ± 0.636
1.164GlyTrp: 1.164 ± 0.34
3.657GlyTyr: 3.657 ± 0.577
0.0GlyXaa: 0.0 ± 0.0
His
1.247HisAla: 1.247 ± 0.381
0.332HisCys: 0.332 ± 0.186
0.914HisAsp: 0.914 ± 0.328
0.914HisGlu: 0.914 ± 0.283
0.499HisPhe: 0.499 ± 0.202
1.081HisGly: 1.081 ± 0.321
0.499HisHis: 0.499 ± 0.274
0.914HisIle: 0.914 ± 0.368
0.499HisLys: 0.499 ± 0.178
0.748HisLeu: 0.748 ± 0.336
0.249HisMet: 0.249 ± 0.167
0.997HisAsn: 0.997 ± 0.331
0.499HisPro: 0.499 ± 0.18
0.582HisGln: 0.582 ± 0.213
0.499HisArg: 0.499 ± 0.186
1.164HisSer: 1.164 ± 0.384
0.499HisThr: 0.499 ± 0.205
0.748HisVal: 0.748 ± 0.235
0.249HisTrp: 0.249 ± 0.142
0.748HisTyr: 0.748 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
4.821IleAla: 4.821 ± 0.642
0.332IleCys: 0.332 ± 0.144
4.738IleAsp: 4.738 ± 0.702
5.735IleGlu: 5.735 ± 0.779
2.41IlePhe: 2.41 ± 0.416
4.156IleGly: 4.156 ± 0.889
1.662IleHis: 1.662 ± 0.564
4.322IleIle: 4.322 ± 0.761
6.982IleLys: 6.982 ± 0.587
4.572IleLeu: 4.572 ± 0.548
1.496IleMet: 1.496 ± 0.331
5.236IleAsn: 5.236 ± 0.799
2.494IlePro: 2.494 ± 0.501
3.325IleGln: 3.325 ± 0.437
2.161IleArg: 2.161 ± 0.521
4.987IleSer: 4.987 ± 0.641
4.572IleThr: 4.572 ± 0.584
2.826IleVal: 2.826 ± 0.636
0.831IleTrp: 0.831 ± 0.21
2.41IleTyr: 2.41 ± 0.527
0.0IleXaa: 0.0 ± 0.0
Lys
7.314LysAla: 7.314 ± 0.929
0.0LysCys: 0.0 ± 0.0
4.904LysAsp: 4.904 ± 0.631
5.569LysGlu: 5.569 ± 0.815
3.491LysPhe: 3.491 ± 0.501
6.151LysGly: 6.151 ± 0.849
1.413LysHis: 1.413 ± 0.506
6.483LysIle: 6.483 ± 0.869
8.644LysLys: 8.644 ± 1.108
7.481LysLeu: 7.481 ± 1.047
2.41LysMet: 2.41 ± 0.42
6.982LysAsn: 6.982 ± 0.975
2.494LysPro: 2.494 ± 0.404
4.572LysGln: 4.572 ± 0.742
3.99LysArg: 3.99 ± 0.674
3.574LysSer: 3.574 ± 0.522
5.652LysThr: 5.652 ± 0.84
3.99LysVal: 3.99 ± 0.608
1.164LysTrp: 1.164 ± 0.335
3.242LysTyr: 3.242 ± 0.564
0.0LysXaa: 0.0 ± 0.0
Leu
4.405LeuAla: 4.405 ± 0.522
0.914LeuCys: 0.914 ± 0.37
4.987LeuAsp: 4.987 ± 0.622
4.821LeuGlu: 4.821 ± 0.963
3.242LeuPhe: 3.242 ± 0.418
3.408LeuGly: 3.408 ± 0.51
0.416LeuHis: 0.416 ± 0.167
5.236LeuIle: 5.236 ± 0.609
6.982LeuLys: 6.982 ± 0.943
5.403LeuLeu: 5.403 ± 0.656
2.66LeuMet: 2.66 ± 0.439
5.735LeuAsn: 5.735 ± 0.856
2.992LeuPro: 2.992 ± 0.437
3.907LeuGln: 3.907 ± 0.514
1.912LeuArg: 1.912 ± 0.465
6.4LeuSer: 6.4 ± 0.597
5.07LeuThr: 5.07 ± 0.769
3.408LeuVal: 3.408 ± 0.534
1.579LeuTrp: 1.579 ± 0.61
2.244LeuTyr: 2.244 ± 0.36
0.0LeuXaa: 0.0 ± 0.0
Met
1.912MetAla: 1.912 ± 0.352
0.249MetCys: 0.249 ± 0.152
1.164MetAsp: 1.164 ± 0.26
1.745MetGlu: 1.745 ± 0.475
0.416MetPhe: 0.416 ± 0.199
1.662MetGly: 1.662 ± 0.402
0.166MetHis: 0.166 ± 0.129
1.829MetIle: 1.829 ± 0.327
2.41MetLys: 2.41 ± 0.525
1.745MetLeu: 1.745 ± 0.373
0.582MetMet: 0.582 ± 0.231
1.662MetAsn: 1.662 ± 0.39
0.499MetPro: 0.499 ± 0.258
1.164MetGln: 1.164 ± 0.333
1.33MetArg: 1.33 ± 0.393
2.078MetSer: 2.078 ± 0.493
2.494MetThr: 2.494 ± 0.522
0.914MetVal: 0.914 ± 0.278
0.249MetTrp: 0.249 ± 0.156
0.748MetTyr: 0.748 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
4.655AsnAla: 4.655 ± 0.645
0.332AsnCys: 0.332 ± 0.165
2.992AsnAsp: 2.992 ± 0.482
3.159AsnGlu: 3.159 ± 0.731
2.743AsnPhe: 2.743 ± 0.408
7.065AsnGly: 7.065 ± 1.27
0.665AsnHis: 0.665 ± 0.246
4.239AsnIle: 4.239 ± 0.655
5.403AsnLys: 5.403 ± 0.648
5.236AsnLeu: 5.236 ± 0.615
1.579AsnMet: 1.579 ± 0.411
3.657AsnAsn: 3.657 ± 0.574
2.244AsnPro: 2.244 ± 0.506
3.159AsnGln: 3.159 ± 0.691
1.995AsnArg: 1.995 ± 0.28
3.574AsnSer: 3.574 ± 0.576
4.239AsnThr: 4.239 ± 0.707
3.99AsnVal: 3.99 ± 0.528
1.33AsnTrp: 1.33 ± 0.291
3.075AsnTyr: 3.075 ± 0.645
0.0AsnXaa: 0.0 ± 0.0
Pro
0.914ProAla: 0.914 ± 0.316
0.083ProCys: 0.083 ± 0.09
2.078ProAsp: 2.078 ± 0.442
2.161ProGlu: 2.161 ± 0.444
1.247ProPhe: 1.247 ± 0.294
1.413ProGly: 1.413 ± 0.434
0.665ProHis: 0.665 ± 0.244
2.743ProIle: 2.743 ± 0.605
2.909ProLys: 2.909 ± 0.437
2.327ProLeu: 2.327 ± 0.433
0.249ProMet: 0.249 ± 0.144
1.164ProAsn: 1.164 ± 0.387
0.831ProPro: 0.831 ± 0.219
1.247ProGln: 1.247 ± 0.364
0.748ProArg: 0.748 ± 0.218
1.912ProSer: 1.912 ± 0.533
1.413ProThr: 1.413 ± 0.288
2.078ProVal: 2.078 ± 0.427
0.166ProTrp: 0.166 ± 0.116
0.831ProTyr: 0.831 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
4.322GlnAla: 4.322 ± 0.681
0.166GlnCys: 0.166 ± 0.103
1.829GlnAsp: 1.829 ± 0.454
3.907GlnGlu: 3.907 ± 0.503
2.161GlnPhe: 2.161 ± 0.431
2.66GlnGly: 2.66 ± 0.553
0.499GlnHis: 0.499 ± 0.228
2.992GlnIle: 2.992 ± 0.562
2.909GlnLys: 2.909 ± 0.647
2.909GlnLeu: 2.909 ± 0.749
0.914GlnMet: 0.914 ± 0.302
2.577GlnAsn: 2.577 ± 0.488
0.582GlnPro: 0.582 ± 0.238
2.327GlnGln: 2.327 ± 0.611
1.413GlnArg: 1.413 ± 0.374
3.075GlnSer: 3.075 ± 0.658
2.826GlnThr: 2.826 ± 0.427
3.408GlnVal: 3.408 ± 0.486
0.831GlnTrp: 0.831 ± 0.365
1.579GlnTyr: 1.579 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
2.327ArgAla: 2.327 ± 0.428
0.249ArgCys: 0.249 ± 0.151
2.327ArgAsp: 2.327 ± 0.432
2.327ArgGlu: 2.327 ± 0.471
1.662ArgPhe: 1.662 ± 0.339
1.496ArgGly: 1.496 ± 0.324
0.332ArgHis: 0.332 ± 0.206
2.577ArgIle: 2.577 ± 0.552
4.904ArgLys: 4.904 ± 0.848
3.823ArgLeu: 3.823 ± 0.727
1.081ArgMet: 1.081 ± 0.372
2.327ArgAsn: 2.327 ± 0.498
1.081ArgPro: 1.081 ± 0.445
0.914ArgGln: 0.914 ± 0.289
1.413ArgArg: 1.413 ± 0.429
1.662ArgSer: 1.662 ± 0.343
2.244ArgThr: 2.244 ± 0.449
1.995ArgVal: 1.995 ± 0.423
0.249ArgTrp: 0.249 ± 0.152
1.662ArgTyr: 1.662 ± 0.434
0.0ArgXaa: 0.0 ± 0.0
Ser
3.657SerAla: 3.657 ± 0.727
0.083SerCys: 0.083 ± 0.092
5.486SerAsp: 5.486 ± 0.525
4.572SerGlu: 4.572 ± 0.616
3.408SerPhe: 3.408 ± 0.473
4.738SerGly: 4.738 ± 0.924
0.582SerHis: 0.582 ± 0.234
3.99SerIle: 3.99 ± 0.843
3.74SerLys: 3.74 ± 0.593
4.572SerLeu: 4.572 ± 0.674
1.662SerMet: 1.662 ± 0.418
5.403SerAsn: 5.403 ± 0.968
0.831SerPro: 0.831 ± 0.314
3.325SerGln: 3.325 ± 0.542
2.244SerArg: 2.244 ± 0.379
4.655SerSer: 4.655 ± 0.88
3.574SerThr: 3.574 ± 0.571
5.486SerVal: 5.486 ± 0.743
0.997SerTrp: 0.997 ± 0.289
2.577SerTyr: 2.577 ± 0.461
0.0SerXaa: 0.0 ± 0.0
Thr
5.153ThrAla: 5.153 ± 0.566
0.332ThrCys: 0.332 ± 0.213
3.907ThrAsp: 3.907 ± 0.524
3.99ThrGlu: 3.99 ± 0.627
2.992ThrPhe: 2.992 ± 0.535
4.821ThrGly: 4.821 ± 0.636
1.247ThrHis: 1.247 ± 0.383
4.405ThrIle: 4.405 ± 0.586
5.569ThrLys: 5.569 ± 0.552
4.073ThrLeu: 4.073 ± 0.491
1.247ThrMet: 1.247 ± 0.407
3.491ThrAsn: 3.491 ± 0.576
2.244ThrPro: 2.244 ± 0.391
1.496ThrGln: 1.496 ± 0.306
2.66ThrArg: 2.66 ± 0.485
4.488ThrSer: 4.488 ± 0.674
5.07ThrThr: 5.07 ± 0.629
4.821ThrVal: 4.821 ± 0.863
0.914ThrTrp: 0.914 ± 0.331
2.078ThrTyr: 2.078 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
3.491ValAla: 3.491 ± 0.702
0.0ValCys: 0.0 ± 0.0
4.405ValAsp: 4.405 ± 0.925
3.99ValGlu: 3.99 ± 0.754
2.494ValPhe: 2.494 ± 0.494
2.909ValGly: 2.909 ± 0.516
0.914ValHis: 0.914 ± 0.362
4.156ValIle: 4.156 ± 0.576
5.652ValLys: 5.652 ± 0.774
4.156ValLeu: 4.156 ± 0.699
1.413ValMet: 1.413 ± 0.403
4.322ValAsn: 4.322 ± 0.74
1.496ValPro: 1.496 ± 0.346
2.327ValGln: 2.327 ± 0.453
1.496ValArg: 1.496 ± 0.433
4.073ValSer: 4.073 ± 0.592
4.156ValThr: 4.156 ± 0.726
3.408ValVal: 3.408 ± 0.473
0.665ValTrp: 0.665 ± 0.283
2.161ValTyr: 2.161 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
1.164TrpAla: 1.164 ± 0.31
0.166TrpCys: 0.166 ± 0.119
1.33TrpAsp: 1.33 ± 0.348
0.914TrpGlu: 0.914 ± 0.37
0.499TrpPhe: 0.499 ± 0.193
0.665TrpGly: 0.665 ± 0.238
0.332TrpHis: 0.332 ± 0.158
1.413TrpIle: 1.413 ± 0.323
1.413TrpLys: 1.413 ± 0.334
0.997TrpLeu: 0.997 ± 0.339
0.083TrpMet: 0.083 ± 0.082
1.413TrpAsn: 1.413 ± 0.526
0.166TrpPro: 0.166 ± 0.103
1.164TrpGln: 1.164 ± 0.445
1.081TrpArg: 1.081 ± 0.249
0.748TrpSer: 0.748 ± 0.254
0.997TrpThr: 0.997 ± 0.451
0.582TrpVal: 0.582 ± 0.25
0.332TrpTrp: 0.332 ± 0.178
0.748TrpTyr: 0.748 ± 0.349
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.496TyrAla: 1.496 ± 0.384
0.249TyrCys: 0.249 ± 0.123
2.826TyrAsp: 2.826 ± 0.445
2.244TyrGlu: 2.244 ± 0.515
1.912TyrPhe: 1.912 ± 0.453
2.327TyrGly: 2.327 ± 0.482
0.582TyrHis: 0.582 ± 0.21
2.826TyrIle: 2.826 ± 0.509
3.159TyrLys: 3.159 ± 0.592
2.327TyrLeu: 2.327 ± 0.556
0.914TyrMet: 0.914 ± 0.285
1.995TyrAsn: 1.995 ± 0.439
2.161TyrPro: 2.161 ± 0.541
2.494TyrGln: 2.494 ± 0.527
1.745TyrArg: 1.745 ± 0.407
3.075TyrSer: 3.075 ± 0.447
2.327TyrThr: 2.327 ± 0.372
2.327TyrVal: 2.327 ± 0.512
0.997TyrTrp: 0.997 ± 0.269
1.912TyrTyr: 1.912 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12032 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski