Amino acid dipepetide frequency for Microbacterium phage Shee

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.283AlaAla: 11.283 ± 1.485
0.682AlaCys: 0.682 ± 0.281
4.846AlaAsp: 4.846 ± 0.781
6.436AlaGlu: 6.436 ± 0.765
2.877AlaPhe: 2.877 ± 0.566
7.799AlaGly: 7.799 ± 1.136
1.817AlaHis: 1.817 ± 0.335
5.679AlaIle: 5.679 ± 1.081
5.225AlaLys: 5.225 ± 0.79
8.254AlaLeu: 8.254 ± 1.031
2.65AlaMet: 2.65 ± 0.508
3.029AlaAsn: 3.029 ± 0.398
4.165AlaPro: 4.165 ± 0.662
3.938AlaGln: 3.938 ± 0.644
5.604AlaArg: 5.604 ± 0.801
5.376AlaSer: 5.376 ± 0.737
6.436AlaThr: 6.436 ± 0.697
6.739AlaVal: 6.739 ± 0.583
1.969AlaTrp: 1.969 ± 0.464
2.726AlaTyr: 2.726 ± 0.537
0.0AlaXaa: 0.0 ± 0.0
Cys
0.379CysAla: 0.379 ± 0.154
0.0CysCys: 0.0 ± 0.0
0.303CysAsp: 0.303 ± 0.146
0.303CysGlu: 0.303 ± 0.198
0.151CysPhe: 0.151 ± 0.115
0.757CysGly: 0.757 ± 0.246
0.151CysHis: 0.151 ± 0.119
0.0CysIle: 0.0 ± 0.0
0.53CysLys: 0.53 ± 0.219
0.606CysLeu: 0.606 ± 0.298
0.076CysMet: 0.076 ± 0.078
0.151CysAsn: 0.151 ± 0.11
0.606CysPro: 0.606 ± 0.228
0.076CysGln: 0.076 ± 0.078
0.379CysArg: 0.379 ± 0.158
0.379CysSer: 0.379 ± 0.152
0.454CysThr: 0.454 ± 0.183
0.53CysVal: 0.53 ± 0.173
0.227CysTrp: 0.227 ± 0.129
0.454CysTyr: 0.454 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
4.619AspAla: 4.619 ± 0.59
0.757AspCys: 0.757 ± 0.248
4.846AspAsp: 4.846 ± 0.859
6.058AspGlu: 6.058 ± 1.298
2.196AspPhe: 2.196 ± 0.373
4.392AspGly: 4.392 ± 0.564
1.136AspHis: 1.136 ± 0.238
3.332AspIle: 3.332 ± 0.398
2.726AspLys: 2.726 ± 0.446
5.376AspLeu: 5.376 ± 0.767
1.287AspMet: 1.287 ± 0.355
1.514AspAsn: 1.514 ± 0.447
4.468AspPro: 4.468 ± 0.709
1.817AspGln: 1.817 ± 0.413
3.408AspArg: 3.408 ± 0.501
3.256AspSer: 3.256 ± 0.447
2.802AspThr: 2.802 ± 0.538
4.543AspVal: 4.543 ± 0.583
2.196AspTrp: 2.196 ± 0.416
2.802AspTyr: 2.802 ± 0.449
0.0AspXaa: 0.0 ± 0.0
Glu
8.481GluAla: 8.481 ± 0.965
0.303GluCys: 0.303 ± 0.187
4.695GluAsp: 4.695 ± 1.055
5.604GluGlu: 5.604 ± 1.258
2.045GluPhe: 2.045 ± 0.364
4.316GluGly: 4.316 ± 0.651
1.136GluHis: 1.136 ± 0.318
2.802GluIle: 2.802 ± 0.513
2.12GluLys: 2.12 ± 0.575
5.528GluLeu: 5.528 ± 0.674
2.347GluMet: 2.347 ± 0.505
1.893GluAsn: 1.893 ± 0.472
2.953GluPro: 2.953 ± 0.5
2.65GluGln: 2.65 ± 0.577
3.559GluArg: 3.559 ± 0.531
3.029GluSer: 3.029 ± 0.592
3.559GluThr: 3.559 ± 0.523
5.452GluVal: 5.452 ± 0.603
1.439GluTrp: 1.439 ± 0.321
1.893GluTyr: 1.893 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
2.575PheAla: 2.575 ± 0.393
0.151PheCys: 0.151 ± 0.098
2.499PheAsp: 2.499 ± 0.464
2.045PheGlu: 2.045 ± 0.355
0.606PhePhe: 0.606 ± 0.183
3.483PheGly: 3.483 ± 0.553
0.757PheHis: 0.757 ± 0.228
1.363PheIle: 1.363 ± 0.328
1.666PheLys: 1.666 ± 0.343
2.045PheLeu: 2.045 ± 0.496
0.53PheMet: 0.53 ± 0.298
1.136PheAsn: 1.136 ± 0.263
1.212PhePro: 1.212 ± 0.297
1.136PheGln: 1.136 ± 0.322
2.65PheArg: 2.65 ± 0.481
1.59PheSer: 1.59 ± 0.27
2.347PheThr: 2.347 ± 0.406
1.666PheVal: 1.666 ± 0.331
0.682PheTrp: 0.682 ± 0.299
0.757PheTyr: 0.757 ± 0.294
0.0PheXaa: 0.0 ± 0.0
Gly
6.664GlyAla: 6.664 ± 0.912
0.757GlyCys: 0.757 ± 0.238
4.089GlyAsp: 4.089 ± 0.538
3.786GlyGlu: 3.786 ± 0.468
2.953GlyPhe: 2.953 ± 0.423
5.073GlyGly: 5.073 ± 0.81
1.969GlyHis: 1.969 ± 0.373
5.149GlyIle: 5.149 ± 1.089
5.149GlyLys: 5.149 ± 0.765
6.058GlyLeu: 6.058 ± 0.728
2.196GlyMet: 2.196 ± 0.362
2.423GlyAsn: 2.423 ± 0.372
3.635GlyPro: 3.635 ± 0.526
3.635GlyGln: 3.635 ± 0.716
4.468GlyArg: 4.468 ± 0.787
5.301GlySer: 5.301 ± 0.651
5.906GlyThr: 5.906 ± 0.923
6.361GlyVal: 6.361 ± 0.808
1.59GlyTrp: 1.59 ± 0.331
2.196GlyTyr: 2.196 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
1.06HisAla: 1.06 ± 0.261
0.151HisCys: 0.151 ± 0.116
0.984HisAsp: 0.984 ± 0.254
1.363HisGlu: 1.363 ± 0.306
0.833HisPhe: 0.833 ± 0.257
1.817HisGly: 1.817 ± 0.367
0.379HisHis: 0.379 ± 0.172
0.757HisIle: 0.757 ± 0.283
1.363HisLys: 1.363 ± 0.352
1.136HisLeu: 1.136 ± 0.307
0.379HisMet: 0.379 ± 0.177
0.757HisAsn: 0.757 ± 0.249
0.606HisPro: 0.606 ± 0.227
0.303HisGln: 0.303 ± 0.159
0.682HisArg: 0.682 ± 0.232
1.893HisSer: 1.893 ± 0.334
1.136HisThr: 1.136 ± 0.271
1.817HisVal: 1.817 ± 0.394
0.454HisTrp: 0.454 ± 0.159
0.909HisTyr: 0.909 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
5.301IleAla: 5.301 ± 0.587
0.303IleCys: 0.303 ± 0.168
4.846IleAsp: 4.846 ± 0.525
3.332IleGlu: 3.332 ± 0.56
0.606IlePhe: 0.606 ± 0.283
4.24IleGly: 4.24 ± 0.749
0.682IleHis: 0.682 ± 0.221
3.18IleIle: 3.18 ± 0.868
2.499IleLys: 2.499 ± 0.495
2.953IleLeu: 2.953 ± 0.432
1.287IleMet: 1.287 ± 0.372
1.817IleAsn: 1.817 ± 0.428
2.877IlePro: 2.877 ± 0.464
2.499IleGln: 2.499 ± 0.517
2.347IleArg: 2.347 ± 0.512
3.408IleSer: 3.408 ± 0.552
3.332IleThr: 3.332 ± 0.708
2.726IleVal: 2.726 ± 0.605
0.757IleTrp: 0.757 ± 0.253
1.514IleTyr: 1.514 ± 0.34
0.0IleXaa: 0.0 ± 0.0
Lys
4.922LysAla: 4.922 ± 0.76
0.303LysCys: 0.303 ± 0.135
2.423LysAsp: 2.423 ± 0.491
3.332LysGlu: 3.332 ± 0.537
1.06LysPhe: 1.06 ± 0.247
3.483LysGly: 3.483 ± 0.474
0.984LysHis: 0.984 ± 0.294
2.045LysIle: 2.045 ± 0.438
2.347LysLys: 2.347 ± 0.551
4.316LysLeu: 4.316 ± 0.595
1.363LysMet: 1.363 ± 0.363
1.666LysAsn: 1.666 ± 0.369
3.408LysPro: 3.408 ± 0.641
2.196LysGln: 2.196 ± 0.55
2.802LysArg: 2.802 ± 0.462
2.272LysSer: 2.272 ± 0.43
2.65LysThr: 2.65 ± 0.422
3.483LysVal: 3.483 ± 0.545
1.06LysTrp: 1.06 ± 0.27
1.287LysTyr: 1.287 ± 0.305
0.0LysXaa: 0.0 ± 0.0
Leu
8.254LeuAla: 8.254 ± 0.888
0.606LeuCys: 0.606 ± 0.201
6.209LeuAsp: 6.209 ± 0.535
6.058LeuGlu: 6.058 ± 0.854
2.12LeuPhe: 2.12 ± 0.375
6.134LeuGly: 6.134 ± 0.618
1.212LeuHis: 1.212 ± 0.361
4.922LeuIle: 4.922 ± 1.035
4.468LeuLys: 4.468 ± 0.626
8.178LeuLeu: 8.178 ± 0.788
2.12LeuMet: 2.12 ± 0.378
2.877LeuAsn: 2.877 ± 0.398
4.089LeuPro: 4.089 ± 0.518
3.105LeuGln: 3.105 ± 0.48
5.755LeuArg: 5.755 ± 0.634
4.392LeuSer: 4.392 ± 0.495
5.301LeuThr: 5.301 ± 0.559
6.285LeuVal: 6.285 ± 0.673
1.287LeuTrp: 1.287 ± 0.262
1.893LeuTyr: 1.893 ± 0.339
0.0LeuXaa: 0.0 ± 0.0
Met
2.877MetAla: 2.877 ± 0.413
0.227MetCys: 0.227 ± 0.122
1.666MetAsp: 1.666 ± 0.36
1.06MetGlu: 1.06 ± 0.269
0.757MetPhe: 0.757 ± 0.21
1.893MetGly: 1.893 ± 0.487
0.151MetHis: 0.151 ± 0.083
0.984MetIle: 0.984 ± 0.291
0.379MetLys: 0.379 ± 0.15
2.65MetLeu: 2.65 ± 0.582
0.682MetMet: 0.682 ± 0.176
0.833MetAsn: 0.833 ± 0.24
1.59MetPro: 1.59 ± 0.301
0.757MetGln: 0.757 ± 0.24
0.682MetArg: 0.682 ± 0.185
3.105MetSer: 3.105 ± 0.496
2.347MetThr: 2.347 ± 0.38
1.893MetVal: 1.893 ± 0.37
0.151MetTrp: 0.151 ± 0.132
0.379MetTyr: 0.379 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
2.953AsnAla: 2.953 ± 0.646
0.076AsnCys: 0.076 ± 0.078
1.817AsnAsp: 1.817 ± 0.322
1.817AsnGlu: 1.817 ± 0.398
0.757AsnPhe: 0.757 ± 0.241
3.18AsnGly: 3.18 ± 0.42
0.606AsnHis: 0.606 ± 0.168
1.969AsnIle: 1.969 ± 0.451
1.514AsnLys: 1.514 ± 0.354
2.953AsnLeu: 2.953 ± 0.405
0.454AsnMet: 0.454 ± 0.179
1.136AsnAsn: 1.136 ± 0.318
1.817AsnPro: 1.817 ± 0.381
1.287AsnGln: 1.287 ± 0.359
1.742AsnArg: 1.742 ± 0.433
2.045AsnSer: 2.045 ± 0.439
1.666AsnThr: 1.666 ± 0.392
1.666AsnVal: 1.666 ± 0.345
0.682AsnTrp: 0.682 ± 0.244
1.212AsnTyr: 1.212 ± 0.339
0.0AsnXaa: 0.0 ± 0.0
Pro
5.755ProAla: 5.755 ± 0.835
0.0ProCys: 0.0 ± 0.0
3.18ProAsp: 3.18 ± 0.528
4.013ProGlu: 4.013 ± 0.752
1.969ProPhe: 1.969 ± 0.419
4.013ProGly: 4.013 ± 0.554
0.682ProHis: 0.682 ± 0.249
2.045ProIle: 2.045 ± 0.356
1.59ProLys: 1.59 ± 0.357
3.938ProLeu: 3.938 ± 0.44
1.136ProMet: 1.136 ± 0.266
1.666ProAsn: 1.666 ± 0.484
1.287ProPro: 1.287 ± 0.346
2.499ProGln: 2.499 ± 0.424
1.969ProArg: 1.969 ± 0.51
2.802ProSer: 2.802 ± 0.376
3.938ProThr: 3.938 ± 0.65
4.771ProVal: 4.771 ± 0.605
0.682ProTrp: 0.682 ± 0.197
1.212ProTyr: 1.212 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
4.316GlnAla: 4.316 ± 0.739
0.227GlnCys: 0.227 ± 0.121
1.666GlnAsp: 1.666 ± 0.385
3.18GlnGlu: 3.18 ± 0.633
0.757GlnPhe: 0.757 ± 0.258
3.408GlnGly: 3.408 ± 0.555
1.363GlnHis: 1.363 ± 0.352
1.439GlnIle: 1.439 ± 0.313
1.06GlnLys: 1.06 ± 0.241
4.013GlnLeu: 4.013 ± 0.673
0.833GlnMet: 0.833 ± 0.262
1.514GlnAsn: 1.514 ± 0.364
1.742GlnPro: 1.742 ± 0.432
2.272GlnGln: 2.272 ± 0.564
2.726GlnArg: 2.726 ± 0.512
2.12GlnSer: 2.12 ± 0.409
2.575GlnThr: 2.575 ± 0.376
2.347GlnVal: 2.347 ± 0.418
1.287GlnTrp: 1.287 ± 0.363
1.06GlnTyr: 1.06 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
4.846ArgAla: 4.846 ± 0.609
0.454ArgCys: 0.454 ± 0.188
3.786ArgAsp: 3.786 ± 0.456
3.18ArgGlu: 3.18 ± 0.504
2.272ArgPhe: 2.272 ± 0.495
3.559ArgGly: 3.559 ± 0.465
1.136ArgHis: 1.136 ± 0.367
2.65ArgIle: 2.65 ± 0.414
3.256ArgLys: 3.256 ± 0.628
5.225ArgLeu: 5.225 ± 0.679
1.969ArgMet: 1.969 ± 0.37
1.439ArgAsn: 1.439 ± 0.373
2.802ArgPro: 2.802 ± 0.527
2.196ArgGln: 2.196 ± 0.357
4.165ArgArg: 4.165 ± 0.663
4.392ArgSer: 4.392 ± 0.671
2.953ArgThr: 2.953 ± 0.639
4.468ArgVal: 4.468 ± 0.591
0.833ArgTrp: 0.833 ± 0.224
1.514ArgTyr: 1.514 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
5.831SerAla: 5.831 ± 0.722
0.227SerCys: 0.227 ± 0.168
3.71SerAsp: 3.71 ± 0.483
3.029SerGlu: 3.029 ± 0.538
2.499SerPhe: 2.499 ± 0.489
5.301SerGly: 5.301 ± 0.806
0.984SerHis: 0.984 ± 0.3
3.029SerIle: 3.029 ± 0.612
3.18SerLys: 3.18 ± 0.53
5.679SerLeu: 5.679 ± 0.612
1.969SerMet: 1.969 ± 0.53
1.742SerAsn: 1.742 ± 0.413
2.575SerPro: 2.575 ± 0.337
2.65SerGln: 2.65 ± 0.368
3.105SerArg: 3.105 ± 0.543
3.862SerSer: 3.862 ± 0.611
3.559SerThr: 3.559 ± 0.527
4.695SerVal: 4.695 ± 0.539
1.363SerTrp: 1.363 ± 0.382
1.893SerTyr: 1.893 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
5.528ThrAla: 5.528 ± 1.022
0.303ThrCys: 0.303 ± 0.147
3.18ThrAsp: 3.18 ± 0.42
3.786ThrGlu: 3.786 ± 0.539
3.105ThrPhe: 3.105 ± 0.499
5.755ThrGly: 5.755 ± 0.612
1.06ThrHis: 1.06 ± 0.302
3.256ThrIle: 3.256 ± 0.46
2.575ThrLys: 2.575 ± 0.446
6.134ThrLeu: 6.134 ± 0.667
1.514ThrMet: 1.514 ± 0.367
1.06ThrAsn: 1.06 ± 0.323
3.256ThrPro: 3.256 ± 0.465
2.272ThrGln: 2.272 ± 0.394
3.559ThrArg: 3.559 ± 0.47
3.635ThrSer: 3.635 ± 0.592
4.846ThrThr: 4.846 ± 0.698
5.755ThrVal: 5.755 ± 0.76
1.287ThrTrp: 1.287 ± 0.323
2.12ThrTyr: 2.12 ± 0.444
0.0ThrXaa: 0.0 ± 0.0
Val
7.799ValAla: 7.799 ± 0.81
0.53ValCys: 0.53 ± 0.232
4.695ValAsp: 4.695 ± 0.629
4.392ValGlu: 4.392 ± 0.578
1.742ValPhe: 1.742 ± 0.346
5.679ValGly: 5.679 ± 0.772
1.59ValHis: 1.59 ± 0.384
3.408ValIle: 3.408 ± 0.526
3.635ValLys: 3.635 ± 0.609
5.831ValLeu: 5.831 ± 0.677
1.59ValMet: 1.59 ± 0.408
3.18ValAsn: 3.18 ± 0.504
3.559ValPro: 3.559 ± 0.535
3.105ValGln: 3.105 ± 0.415
4.316ValArg: 4.316 ± 0.57
4.619ValSer: 4.619 ± 0.622
5.225ValThr: 5.225 ± 0.674
5.528ValVal: 5.528 ± 0.668
1.742ValTrp: 1.742 ± 0.455
2.65ValTyr: 2.65 ± 0.4
0.0ValXaa: 0.0 ± 0.0
Trp
1.287TrpAla: 1.287 ± 0.369
0.227TrpCys: 0.227 ± 0.136
1.893TrpAsp: 1.893 ± 0.386
1.212TrpGlu: 1.212 ± 0.271
0.757TrpPhe: 0.757 ± 0.233
1.666TrpGly: 1.666 ± 0.414
0.53TrpHis: 0.53 ± 0.162
1.06TrpIle: 1.06 ± 0.288
0.757TrpLys: 0.757 ± 0.202
2.196TrpLeu: 2.196 ± 0.401
0.227TrpMet: 0.227 ± 0.13
0.606TrpAsn: 0.606 ± 0.171
1.06TrpPro: 1.06 ± 0.329
0.757TrpGln: 0.757 ± 0.286
1.06TrpArg: 1.06 ± 0.25
0.984TrpSer: 0.984 ± 0.321
1.59TrpThr: 1.59 ± 0.335
1.666TrpVal: 1.666 ± 0.366
0.682TrpTrp: 0.682 ± 0.319
1.136TrpTyr: 1.136 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.802TyrAla: 2.802 ± 0.447
0.227TyrCys: 0.227 ± 0.144
2.196TyrAsp: 2.196 ± 0.375
1.817TyrGlu: 1.817 ± 0.377
0.833TyrPhe: 0.833 ± 0.252
3.408TyrGly: 3.408 ± 0.624
0.454TyrHis: 0.454 ± 0.184
1.439TyrIle: 1.439 ± 0.278
1.363TyrLys: 1.363 ± 0.321
2.196TyrLeu: 2.196 ± 0.35
0.379TyrMet: 0.379 ± 0.185
0.909TyrAsn: 0.909 ± 0.287
1.514TyrPro: 1.514 ± 0.365
0.757TyrGln: 0.757 ± 0.242
2.12TyrArg: 2.12 ± 0.415
2.423TyrSer: 2.423 ± 0.405
1.363TyrThr: 1.363 ± 0.252
2.423TyrVal: 2.423 ± 0.556
0.984TyrTrp: 0.984 ± 0.272
1.212TyrTyr: 1.212 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13207 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski