Amino acid dipepetide frequency for Lactococcus phage 28201

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.463AlaAla: 4.463 ± 0.794
0.093AlaCys: 0.093 ± 0.096
3.813AlaAsp: 3.813 ± 0.664
3.813AlaGlu: 3.813 ± 0.528
1.674AlaPhe: 1.674 ± 0.376
4.835AlaGly: 4.835 ± 0.753
1.302AlaHis: 1.302 ± 0.373
4.742AlaIle: 4.742 ± 0.859
4.742AlaLys: 4.742 ± 0.622
6.323AlaLeu: 6.323 ± 1.278
2.139AlaMet: 2.139 ± 0.503
3.627AlaAsn: 3.627 ± 0.635
2.418AlaPro: 2.418 ± 0.506
3.255AlaGln: 3.255 ± 0.399
2.139AlaArg: 2.139 ± 0.453
3.348AlaSer: 3.348 ± 0.552
4.184AlaThr: 4.184 ± 0.81
4.277AlaVal: 4.277 ± 0.685
1.116AlaTrp: 1.116 ± 0.283
2.418AlaTyr: 2.418 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.093CysAla: 0.093 ± 0.08
0.093CysCys: 0.093 ± 0.098
0.465CysAsp: 0.465 ± 0.205
0.558CysGlu: 0.558 ± 0.223
0.465CysPhe: 0.465 ± 0.188
0.744CysGly: 0.744 ± 0.353
0.279CysHis: 0.279 ± 0.165
0.279CysIle: 0.279 ± 0.141
0.186CysLys: 0.186 ± 0.125
0.279CysLeu: 0.279 ± 0.146
0.0CysMet: 0.0 ± 0.0
0.093CysAsn: 0.093 ± 0.085
0.279CysPro: 0.279 ± 0.165
0.0CysGln: 0.0 ± 0.0
0.372CysArg: 0.372 ± 0.158
0.465CysSer: 0.465 ± 0.21
0.279CysThr: 0.279 ± 0.195
0.093CysVal: 0.093 ± 0.088
0.093CysTrp: 0.093 ± 0.112
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.511AspAla: 2.511 ± 0.686
0.558AspCys: 0.558 ± 0.223
4.184AspAsp: 4.184 ± 0.632
5.021AspGlu: 5.021 ± 0.754
3.813AspPhe: 3.813 ± 0.729
6.044AspGly: 6.044 ± 1.393
0.465AspHis: 0.465 ± 0.189
4.092AspIle: 4.092 ± 0.589
6.416AspLys: 6.416 ± 0.885
4.092AspLeu: 4.092 ± 0.674
1.953AspMet: 1.953 ± 0.47
4.556AspAsn: 4.556 ± 0.606
1.023AspPro: 1.023 ± 0.397
1.023AspGln: 1.023 ± 0.347
2.232AspArg: 2.232 ± 0.436
4.277AspSer: 4.277 ± 0.631
3.534AspThr: 3.534 ± 0.635
4.092AspVal: 4.092 ± 0.648
0.744AspTrp: 0.744 ± 0.266
3.162AspTyr: 3.162 ± 0.619
0.0AspXaa: 0.0 ± 0.0
Glu
4.184GluAla: 4.184 ± 0.84
0.558GluCys: 0.558 ± 0.234
3.255GluAsp: 3.255 ± 0.498
6.137GluGlu: 6.137 ± 1.258
3.534GluPhe: 3.534 ± 0.48
2.511GluGly: 2.511 ± 0.438
1.302GluHis: 1.302 ± 0.441
6.416GluIle: 6.416 ± 0.751
5.765GluLys: 5.765 ± 0.991
7.253GluLeu: 7.253 ± 0.902
2.046GluMet: 2.046 ± 0.488
3.627GluAsn: 3.627 ± 0.766
1.395GluPro: 1.395 ± 0.389
3.72GluGln: 3.72 ± 0.676
3.255GluArg: 3.255 ± 0.719
3.999GluSer: 3.999 ± 0.678
4.835GluThr: 4.835 ± 0.818
3.999GluVal: 3.999 ± 0.737
1.023GluTrp: 1.023 ± 0.402
2.511GluTyr: 2.511 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
2.604PheAla: 2.604 ± 0.481
0.186PheCys: 0.186 ± 0.137
2.79PheAsp: 2.79 ± 0.589
4.277PheGlu: 4.277 ± 0.585
1.674PhePhe: 1.674 ± 0.421
2.418PheGly: 2.418 ± 0.394
0.372PheHis: 0.372 ± 0.153
3.534PheIle: 3.534 ± 0.527
3.255PheLys: 3.255 ± 0.36
2.139PheLeu: 2.139 ± 0.481
1.488PheMet: 1.488 ± 0.422
2.79PheAsn: 2.79 ± 0.428
1.395PhePro: 1.395 ± 0.336
1.581PheGln: 1.581 ± 0.388
1.302PheArg: 1.302 ± 0.353
3.069PheSer: 3.069 ± 0.491
2.604PheThr: 2.604 ± 0.412
2.511PheVal: 2.511 ± 0.58
0.651PheTrp: 0.651 ± 0.257
2.046PheTyr: 2.046 ± 0.463
0.0PheXaa: 0.0 ± 0.0
Gly
2.604GlyAla: 2.604 ± 0.763
0.279GlyCys: 0.279 ± 0.189
3.906GlyAsp: 3.906 ± 0.645
3.534GlyGlu: 3.534 ± 0.556
3.162GlyPhe: 3.162 ± 0.524
4.928GlyGly: 4.928 ± 0.667
1.023GlyHis: 1.023 ± 0.267
6.788GlyIle: 6.788 ± 0.683
6.044GlyLys: 6.044 ± 0.877
4.742GlyLeu: 4.742 ± 0.836
1.302GlyMet: 1.302 ± 0.329
3.906GlyAsn: 3.906 ± 0.636
0.837GlyPro: 0.837 ± 0.296
2.604GlyGln: 2.604 ± 0.54
2.046GlyArg: 2.046 ± 0.344
4.928GlySer: 4.928 ± 0.849
5.951GlyThr: 5.951 ± 1.072
3.999GlyVal: 3.999 ± 0.609
0.744GlyTrp: 0.744 ± 0.27
2.604GlyTyr: 2.604 ± 0.407
0.0GlyXaa: 0.0 ± 0.0
His
0.651HisAla: 0.651 ± 0.274
0.186HisCys: 0.186 ± 0.134
0.93HisAsp: 0.93 ± 0.3
1.488HisGlu: 1.488 ± 0.393
0.465HisPhe: 0.465 ± 0.217
0.93HisGly: 0.93 ± 0.354
0.372HisHis: 0.372 ± 0.184
1.023HisIle: 1.023 ± 0.332
1.116HisLys: 1.116 ± 0.276
1.023HisLeu: 1.023 ± 0.272
0.186HisMet: 0.186 ± 0.155
1.023HisAsn: 1.023 ± 0.278
0.279HisPro: 0.279 ± 0.195
0.372HisGln: 0.372 ± 0.167
0.465HisArg: 0.465 ± 0.18
1.023HisSer: 1.023 ± 0.401
0.651HisThr: 0.651 ± 0.205
0.744HisVal: 0.744 ± 0.327
0.093HisTrp: 0.093 ± 0.067
0.744HisTyr: 0.744 ± 0.387
0.0HisXaa: 0.0 ± 0.0
Ile
4.742IleAla: 4.742 ± 0.589
0.372IleCys: 0.372 ± 0.194
5.765IleAsp: 5.765 ± 0.614
4.37IleGlu: 4.37 ± 0.623
2.418IlePhe: 2.418 ± 0.587
4.37IleGly: 4.37 ± 0.61
1.209IleHis: 1.209 ± 0.544
5.393IleIle: 5.393 ± 1.514
7.346IleLys: 7.346 ± 0.843
5.207IleLeu: 5.207 ± 0.669
1.302IleMet: 1.302 ± 0.423
4.928IleAsn: 4.928 ± 0.769
2.883IlePro: 2.883 ± 0.615
3.255IleGln: 3.255 ± 0.632
3.162IleArg: 3.162 ± 0.524
6.509IleSer: 6.509 ± 0.734
4.463IleThr: 4.463 ± 0.888
4.184IleVal: 4.184 ± 0.714
0.558IleTrp: 0.558 ± 0.236
1.395IleTyr: 1.395 ± 0.47
0.0IleXaa: 0.0 ± 0.0
Lys
5.765LysAla: 5.765 ± 1.177
0.558LysCys: 0.558 ± 0.257
5.114LysAsp: 5.114 ± 0.758
6.695LysGlu: 6.695 ± 1.045
3.72LysPhe: 3.72 ± 0.752
5.114LysGly: 5.114 ± 0.614
0.93LysHis: 0.93 ± 0.273
5.672LysIle: 5.672 ± 0.732
6.974LysLys: 6.974 ± 1.251
7.625LysLeu: 7.625 ± 0.972
2.511LysMet: 2.511 ± 0.463
4.928LysAsn: 4.928 ± 0.79
2.139LysPro: 2.139 ± 0.439
3.534LysGln: 3.534 ± 0.652
2.976LysArg: 2.976 ± 0.525
5.765LysSer: 5.765 ± 0.916
5.021LysThr: 5.021 ± 0.805
3.906LysVal: 3.906 ± 0.542
1.209LysTrp: 1.209 ± 0.391
2.79LysTyr: 2.79 ± 0.429
0.0LysXaa: 0.0 ± 0.0
Leu
5.393LeuAla: 5.393 ± 0.785
0.186LeuCys: 0.186 ± 0.143
4.37LeuAsp: 4.37 ± 0.547
5.114LeuGlu: 5.114 ± 0.859
3.441LeuPhe: 3.441 ± 0.576
4.649LeuGly: 4.649 ± 0.85
0.465LeuHis: 0.465 ± 0.208
4.742LeuIle: 4.742 ± 0.856
5.486LeuLys: 5.486 ± 0.903
4.835LeuLeu: 4.835 ± 0.993
2.139LeuMet: 2.139 ± 0.523
4.092LeuAsn: 4.092 ± 0.714
3.813LeuPro: 3.813 ± 1.078
2.883LeuGln: 2.883 ± 0.669
3.441LeuArg: 3.441 ± 0.602
6.881LeuSer: 6.881 ± 0.786
5.951LeuThr: 5.951 ± 0.719
3.906LeuVal: 3.906 ± 0.724
1.302LeuTrp: 1.302 ± 0.411
2.139LeuTyr: 2.139 ± 0.575
0.0LeuXaa: 0.0 ± 0.0
Met
1.86MetAla: 1.86 ± 0.555
0.0MetCys: 0.0 ± 0.0
1.581MetAsp: 1.581 ± 0.43
1.767MetGlu: 1.767 ± 0.464
0.837MetPhe: 0.837 ± 0.336
1.953MetGly: 1.953 ± 0.388
0.186MetHis: 0.186 ± 0.143
1.674MetIle: 1.674 ± 0.588
2.046MetLys: 2.046 ± 0.544
1.395MetLeu: 1.395 ± 0.375
0.837MetMet: 0.837 ± 0.303
1.581MetAsn: 1.581 ± 0.5
0.744MetPro: 0.744 ± 0.265
1.488MetGln: 1.488 ± 0.424
0.837MetArg: 0.837 ± 0.284
1.86MetSer: 1.86 ± 0.513
2.511MetThr: 2.511 ± 0.421
1.116MetVal: 1.116 ± 0.3
0.279MetTrp: 0.279 ± 0.127
0.465MetTyr: 0.465 ± 0.198
0.0MetXaa: 0.0 ± 0.0
Asn
4.184AsnAla: 4.184 ± 0.703
0.372AsnCys: 0.372 ± 0.2
3.627AsnAsp: 3.627 ± 0.571
3.255AsnGlu: 3.255 ± 0.607
3.255AsnPhe: 3.255 ± 0.623
5.951AsnGly: 5.951 ± 1.062
0.837AsnHis: 0.837 ± 0.396
5.207AsnIle: 5.207 ± 0.596
4.37AsnLys: 4.37 ± 0.602
3.534AsnLeu: 3.534 ± 0.448
0.837AsnMet: 0.837 ± 0.348
5.3AsnAsn: 5.3 ± 0.808
2.697AsnPro: 2.697 ± 0.543
2.232AsnGln: 2.232 ± 0.491
2.325AsnArg: 2.325 ± 0.463
3.813AsnSer: 3.813 ± 0.605
2.697AsnThr: 2.697 ± 0.536
3.069AsnVal: 3.069 ± 0.562
0.744AsnTrp: 0.744 ± 0.255
2.418AsnTyr: 2.418 ± 0.483
0.0AsnXaa: 0.0 ± 0.0
Pro
1.767ProAla: 1.767 ± 0.512
0.0ProCys: 0.0 ± 0.0
1.767ProAsp: 1.767 ± 0.47
1.488ProGlu: 1.488 ± 0.326
1.023ProPhe: 1.023 ± 0.244
0.93ProGly: 0.93 ± 0.297
0.651ProHis: 0.651 ± 0.209
2.046ProIle: 2.046 ± 0.375
2.325ProLys: 2.325 ± 0.444
2.046ProLeu: 2.046 ± 0.503
0.558ProMet: 0.558 ± 0.232
1.488ProAsn: 1.488 ± 0.331
0.93ProPro: 0.93 ± 0.29
2.325ProGln: 2.325 ± 0.602
1.209ProArg: 1.209 ± 0.439
2.79ProSer: 2.79 ± 0.459
2.604ProThr: 2.604 ± 0.503
1.953ProVal: 1.953 ± 0.345
0.372ProTrp: 0.372 ± 0.145
1.023ProTyr: 1.023 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
3.627GlnAla: 3.627 ± 0.822
0.186GlnCys: 0.186 ± 0.109
2.046GlnAsp: 2.046 ± 0.433
2.79GlnGlu: 2.79 ± 0.598
1.581GlnPhe: 1.581 ± 0.376
2.697GlnGly: 2.697 ± 0.39
0.558GlnHis: 0.558 ± 0.222
3.162GlnIle: 3.162 ± 0.616
3.534GlnLys: 3.534 ± 0.472
3.72GlnLeu: 3.72 ± 0.772
1.674GlnMet: 1.674 ± 0.434
2.232GlnAsn: 2.232 ± 0.472
0.93GlnPro: 0.93 ± 0.275
2.046GlnGln: 2.046 ± 0.442
1.581GlnArg: 1.581 ± 0.52
2.325GlnSer: 2.325 ± 0.37
2.232GlnThr: 2.232 ± 0.384
2.325GlnVal: 2.325 ± 0.397
0.372GlnTrp: 0.372 ± 0.186
1.116GlnTyr: 1.116 ± 0.3
0.0GlnXaa: 0.0 ± 0.0
Arg
2.325ArgAla: 2.325 ± 0.39
0.465ArgCys: 0.465 ± 0.261
2.697ArgAsp: 2.697 ± 0.489
3.348ArgGlu: 3.348 ± 0.72
1.488ArgPhe: 1.488 ± 0.331
2.046ArgGly: 2.046 ± 0.509
0.558ArgHis: 0.558 ± 0.294
2.604ArgIle: 2.604 ± 0.454
3.441ArgLys: 3.441 ± 0.414
3.162ArgLeu: 3.162 ± 0.581
0.93ArgMet: 0.93 ± 0.287
1.953ArgAsn: 1.953 ± 0.417
1.023ArgPro: 1.023 ± 0.338
1.581ArgGln: 1.581 ± 0.388
2.046ArgArg: 2.046 ± 0.44
1.488ArgSer: 1.488 ± 0.318
1.953ArgThr: 1.953 ± 0.463
1.86ArgVal: 1.86 ± 0.462
0.837ArgTrp: 0.837 ± 0.284
1.767ArgTyr: 1.767 ± 0.363
0.0ArgXaa: 0.0 ± 0.0
Ser
3.999SerAla: 3.999 ± 0.682
0.186SerCys: 0.186 ± 0.132
5.765SerAsp: 5.765 ± 0.753
4.37SerGlu: 4.37 ± 0.668
3.441SerPhe: 3.441 ± 0.532
5.579SerGly: 5.579 ± 0.747
1.302SerHis: 1.302 ± 0.367
4.463SerIle: 4.463 ± 0.734
6.23SerLys: 6.23 ± 0.792
4.277SerLeu: 4.277 ± 0.591
1.209SerMet: 1.209 ± 0.386
4.649SerAsn: 4.649 ± 0.634
1.581SerPro: 1.581 ± 0.356
1.488SerGln: 1.488 ± 0.382
2.511SerArg: 2.511 ± 0.484
5.207SerSer: 5.207 ± 0.795
3.999SerThr: 3.999 ± 0.582
4.463SerVal: 4.463 ± 0.679
1.116SerTrp: 1.116 ± 0.338
3.627SerTyr: 3.627 ± 0.487
0.0SerXaa: 0.0 ± 0.0
Thr
5.486ThrAla: 5.486 ± 0.925
0.186ThrCys: 0.186 ± 0.129
3.72ThrAsp: 3.72 ± 0.705
4.742ThrGlu: 4.742 ± 0.676
2.139ThrPhe: 2.139 ± 0.537
5.3ThrGly: 5.3 ± 0.637
1.209ThrHis: 1.209 ± 0.349
4.37ThrIle: 4.37 ± 0.642
4.742ThrLys: 4.742 ± 0.661
5.486ThrLeu: 5.486 ± 0.757
1.116ThrMet: 1.116 ± 0.376
4.742ThrAsn: 4.742 ± 0.985
2.697ThrPro: 2.697 ± 0.462
2.976ThrGln: 2.976 ± 0.459
2.418ThrArg: 2.418 ± 0.458
3.441ThrSer: 3.441 ± 0.637
5.951ThrThr: 5.951 ± 1.358
5.207ThrVal: 5.207 ± 1.261
1.023ThrTrp: 1.023 ± 0.355
2.325ThrTyr: 2.325 ± 0.638
0.0ThrXaa: 0.0 ± 0.0
Val
4.742ValAla: 4.742 ± 0.721
0.093ValCys: 0.093 ± 0.08
4.649ValAsp: 4.649 ± 0.515
4.928ValGlu: 4.928 ± 0.69
2.046ValPhe: 2.046 ± 0.358
3.069ValGly: 3.069 ± 0.511
0.372ValHis: 0.372 ± 0.164
4.277ValIle: 4.277 ± 0.5
4.742ValLys: 4.742 ± 0.794
3.069ValLeu: 3.069 ± 0.624
1.581ValMet: 1.581 ± 0.389
3.255ValAsn: 3.255 ± 0.675
1.395ValPro: 1.395 ± 0.475
2.604ValGln: 2.604 ± 0.637
1.488ValArg: 1.488 ± 0.43
4.277ValSer: 4.277 ± 0.62
5.3ValThr: 5.3 ± 0.64
3.627ValVal: 3.627 ± 0.745
0.837ValTrp: 0.837 ± 0.256
1.581ValTyr: 1.581 ± 0.327
0.0ValXaa: 0.0 ± 0.0
Trp
1.209TrpAla: 1.209 ± 0.392
0.093TrpCys: 0.093 ± 0.093
0.93TrpAsp: 0.93 ± 0.3
0.837TrpGlu: 0.837 ± 0.271
0.558TrpPhe: 0.558 ± 0.203
0.558TrpGly: 0.558 ± 0.198
0.093TrpHis: 0.093 ± 0.089
0.93TrpIle: 0.93 ± 0.208
1.395TrpLys: 1.395 ± 0.317
1.209TrpLeu: 1.209 ± 0.373
0.372TrpMet: 0.372 ± 0.206
0.744TrpAsn: 0.744 ± 0.227
0.093TrpPro: 0.093 ± 0.103
0.558TrpGln: 0.558 ± 0.228
0.558TrpArg: 0.558 ± 0.213
0.837TrpSer: 0.837 ± 0.299
1.302TrpThr: 1.302 ± 0.663
0.744TrpVal: 0.744 ± 0.202
0.372TrpTrp: 0.372 ± 0.217
0.744TrpTyr: 0.744 ± 0.325
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.511TyrAla: 2.511 ± 0.537
0.465TyrCys: 0.465 ± 0.279
2.418TyrAsp: 2.418 ± 0.398
2.79TyrGlu: 2.79 ± 0.564
1.953TyrPhe: 1.953 ± 0.339
1.302TyrGly: 1.302 ± 0.439
0.279TyrHis: 0.279 ± 0.181
2.418TyrIle: 2.418 ± 0.5
2.79TyrLys: 2.79 ± 0.542
3.906TyrLeu: 3.906 ± 0.729
0.744TyrMet: 0.744 ± 0.254
1.209TyrAsn: 1.209 ± 0.309
0.744TyrPro: 0.744 ± 0.252
1.209TyrGln: 1.209 ± 0.281
1.209TyrArg: 1.209 ± 0.321
3.162TyrSer: 3.162 ± 0.546
3.255TyrThr: 3.255 ± 0.593
1.86TyrVal: 1.86 ± 0.507
0.651TyrTrp: 0.651 ± 0.237
1.674TyrTyr: 1.674 ± 0.395
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10755 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski