Amino acid dipepetide frequency for Streptococcus phage phi5218

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.14AlaAla: 3.14 ± 0.721
0.611AlaCys: 0.611 ± 0.204
5.058AlaAsp: 5.058 ± 0.653
5.407AlaGlu: 5.407 ± 0.561
2.355AlaPhe: 2.355 ± 0.5
5.233AlaGly: 5.233 ± 1.016
1.134AlaHis: 1.134 ± 0.389
4.971AlaIle: 4.971 ± 0.858
5.756AlaLys: 5.756 ± 0.623
6.105AlaLeu: 6.105 ± 0.626
2.442AlaMet: 2.442 ± 0.485
4.448AlaAsn: 4.448 ± 0.56
1.308AlaPro: 1.308 ± 0.285
3.14AlaGln: 3.14 ± 0.465
2.355AlaArg: 2.355 ± 0.4
4.099AlaSer: 4.099 ± 0.582
4.71AlaThr: 4.71 ± 0.672
3.75AlaVal: 3.75 ± 0.432
0.959AlaTrp: 0.959 ± 0.303
2.006AlaTyr: 2.006 ± 0.391
0.0AlaXaa: 0.0 ± 0.0
Cys
0.436CysAla: 0.436 ± 0.215
0.174CysCys: 0.174 ± 0.131
0.436CysAsp: 0.436 ± 0.168
0.436CysGlu: 0.436 ± 0.178
0.0CysPhe: 0.0 ± 0.0
0.611CysGly: 0.611 ± 0.267
0.262CysHis: 0.262 ± 0.155
0.0CysIle: 0.0 ± 0.0
0.523CysLys: 0.523 ± 0.254
0.698CysLeu: 0.698 ± 0.239
0.0CysMet: 0.0 ± 0.0
0.349CysAsn: 0.349 ± 0.179
0.436CysPro: 0.436 ± 0.211
0.087CysGln: 0.087 ± 0.077
0.611CysArg: 0.611 ± 0.259
0.174CysSer: 0.174 ± 0.102
0.698CysThr: 0.698 ± 0.258
0.174CysVal: 0.174 ± 0.113
0.174CysTrp: 0.174 ± 0.104
0.262CysTyr: 0.262 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
2.616AspAla: 2.616 ± 0.453
0.611AspCys: 0.611 ± 0.28
3.75AspAsp: 3.75 ± 0.583
5.058AspGlu: 5.058 ± 0.837
3.14AspPhe: 3.14 ± 0.51
5.058AspGly: 5.058 ± 0.599
0.785AspHis: 0.785 ± 0.239
4.012AspIle: 4.012 ± 0.571
4.71AspLys: 4.71 ± 0.782
5.407AspLeu: 5.407 ± 0.702
2.093AspMet: 2.093 ± 0.393
3.401AspAsn: 3.401 ± 0.446
0.872AspPro: 0.872 ± 0.295
1.483AspGln: 1.483 ± 0.296
2.704AspArg: 2.704 ± 0.493
3.576AspSer: 3.576 ± 0.539
2.878AspThr: 2.878 ± 0.447
4.361AspVal: 4.361 ± 0.509
1.047AspTrp: 1.047 ± 0.345
2.268AspTyr: 2.268 ± 0.584
0.0AspXaa: 0.0 ± 0.0
Glu
5.756GluAla: 5.756 ± 0.559
0.436GluCys: 0.436 ± 0.202
4.274GluAsp: 4.274 ± 0.564
5.582GluGlu: 5.582 ± 0.772
3.227GluPhe: 3.227 ± 0.457
2.355GluGly: 2.355 ± 0.38
1.221GluHis: 1.221 ± 0.37
6.541GluIle: 6.541 ± 0.758
4.884GluLys: 4.884 ± 0.773
9.332GluLeu: 9.332 ± 1.047
2.965GluMet: 2.965 ± 0.528
4.012GluAsn: 4.012 ± 0.542
2.529GluPro: 2.529 ± 0.468
3.837GluGln: 3.837 ± 0.526
4.535GluArg: 4.535 ± 0.611
3.663GluSer: 3.663 ± 0.49
3.837GluThr: 3.837 ± 0.475
5.233GluVal: 5.233 ± 0.608
0.959GluTrp: 0.959 ± 0.282
2.442GluTyr: 2.442 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
3.314PheAla: 3.314 ± 0.49
0.0PheCys: 0.0 ± 0.0
3.837PheAsp: 3.837 ± 0.488
4.012PheGlu: 4.012 ± 0.552
1.657PhePhe: 1.657 ± 0.329
2.791PheGly: 2.791 ± 0.498
0.349PheHis: 0.349 ± 0.189
2.442PheIle: 2.442 ± 0.425
3.314PheLys: 3.314 ± 0.623
3.053PheLeu: 3.053 ± 0.624
1.047PheMet: 1.047 ± 0.292
2.878PheAsn: 2.878 ± 0.491
0.698PhePro: 0.698 ± 0.217
1.221PheGln: 1.221 ± 0.274
1.657PheArg: 1.657 ± 0.303
1.483PheSer: 1.483 ± 0.367
2.093PheThr: 2.093 ± 0.471
2.442PheVal: 2.442 ± 0.435
0.436PheTrp: 0.436 ± 0.171
1.308PheTyr: 1.308 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
2.791GlyAla: 2.791 ± 0.761
0.523GlyCys: 0.523 ± 0.208
2.965GlyAsp: 2.965 ± 0.355
3.314GlyGlu: 3.314 ± 0.597
2.529GlyPhe: 2.529 ± 0.427
3.227GlyGly: 3.227 ± 0.631
1.395GlyHis: 1.395 ± 0.356
4.71GlyIle: 4.71 ± 0.775
5.843GlyLys: 5.843 ± 0.753
5.582GlyLeu: 5.582 ± 0.797
1.832GlyMet: 1.832 ± 0.337
3.75GlyAsn: 3.75 ± 0.592
1.221GlyPro: 1.221 ± 0.405
3.314GlyGln: 3.314 ± 0.604
3.314GlyArg: 3.314 ± 0.425
3.925GlySer: 3.925 ± 0.823
4.099GlyThr: 4.099 ± 0.818
3.75GlyVal: 3.75 ± 0.699
0.959GlyTrp: 0.959 ± 0.229
3.576GlyTyr: 3.576 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.134HisAla: 1.134 ± 0.274
0.174HisCys: 0.174 ± 0.124
0.523HisAsp: 0.523 ± 0.261
1.483HisGlu: 1.483 ± 0.411
0.959HisPhe: 0.959 ± 0.303
0.959HisGly: 0.959 ± 0.249
0.349HisHis: 0.349 ± 0.162
0.872HisIle: 0.872 ± 0.268
1.221HisLys: 1.221 ± 0.329
1.657HisLeu: 1.657 ± 0.439
0.262HisMet: 0.262 ± 0.124
1.047HisAsn: 1.047 ± 0.321
1.047HisPro: 1.047 ± 0.375
1.134HisGln: 1.134 ± 0.285
1.308HisArg: 1.308 ± 0.336
0.785HisSer: 0.785 ± 0.193
1.221HisThr: 1.221 ± 0.291
0.611HisVal: 0.611 ± 0.292
0.262HisTrp: 0.262 ± 0.118
0.872HisTyr: 0.872 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
4.361IleAla: 4.361 ± 0.414
0.436IleCys: 0.436 ± 0.258
4.535IleAsp: 4.535 ± 0.523
5.407IleGlu: 5.407 ± 0.655
1.744IlePhe: 1.744 ± 0.314
4.797IleGly: 4.797 ± 0.614
0.698IleHis: 0.698 ± 0.185
3.925IleIle: 3.925 ± 0.736
5.669IleLys: 5.669 ± 0.655
6.367IleLeu: 6.367 ± 0.85
1.483IleMet: 1.483 ± 0.406
3.837IleAsn: 3.837 ± 0.546
2.616IlePro: 2.616 ± 0.487
2.18IleGln: 2.18 ± 0.514
2.878IleArg: 2.878 ± 0.429
4.361IleSer: 4.361 ± 0.635
4.099IleThr: 4.099 ± 0.749
3.314IleVal: 3.314 ± 0.535
0.349IleTrp: 0.349 ± 0.167
3.227IleTyr: 3.227 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
6.279LysAla: 6.279 ± 0.659
0.436LysCys: 0.436 ± 0.221
3.663LysAsp: 3.663 ± 0.553
7.413LysGlu: 7.413 ± 1.699
2.791LysPhe: 2.791 ± 0.392
4.274LysGly: 4.274 ± 0.704
2.18LysHis: 2.18 ± 0.482
5.233LysIle: 5.233 ± 0.678
5.582LysLys: 5.582 ± 0.927
5.669LysLeu: 5.669 ± 0.637
2.616LysMet: 2.616 ± 0.498
4.535LysAsn: 4.535 ± 0.656
2.442LysPro: 2.442 ± 0.422
3.576LysGln: 3.576 ± 0.586
4.361LysArg: 4.361 ± 0.637
4.274LysSer: 4.274 ± 0.74
5.146LysThr: 5.146 ± 0.779
5.146LysVal: 5.146 ± 0.736
0.785LysTrp: 0.785 ± 0.214
3.14LysTyr: 3.14 ± 0.555
0.0LysXaa: 0.0 ± 0.0
Leu
7.152LeuAla: 7.152 ± 0.82
0.349LeuCys: 0.349 ± 0.197
5.32LeuAsp: 5.32 ± 0.597
6.977LeuGlu: 6.977 ± 0.749
4.448LeuPhe: 4.448 ± 0.612
5.146LeuGly: 5.146 ± 0.688
1.657LeuHis: 1.657 ± 0.35
4.622LeuIle: 4.622 ± 0.963
8.809LeuLys: 8.809 ± 1.065
8.721LeuLeu: 8.721 ± 1.003
1.047LeuMet: 1.047 ± 0.241
4.274LeuAsn: 4.274 ± 0.489
2.616LeuPro: 2.616 ± 0.692
3.401LeuGln: 3.401 ± 0.561
3.489LeuArg: 3.489 ± 0.578
6.977LeuSer: 6.977 ± 1.026
6.018LeuThr: 6.018 ± 0.583
5.495LeuVal: 5.495 ± 0.838
0.698LeuTrp: 0.698 ± 0.237
1.919LeuTyr: 1.919 ± 0.501
0.0LeuXaa: 0.0 ± 0.0
Met
2.093MetAla: 2.093 ± 0.438
0.262MetCys: 0.262 ± 0.144
1.047MetAsp: 1.047 ± 0.229
1.57MetGlu: 1.57 ± 0.332
0.785MetPhe: 0.785 ± 0.221
1.221MetGly: 1.221 ± 0.492
0.262MetHis: 0.262 ± 0.134
1.395MetIle: 1.395 ± 0.374
2.268MetLys: 2.268 ± 0.495
2.355MetLeu: 2.355 ± 0.43
0.349MetMet: 0.349 ± 0.223
1.308MetAsn: 1.308 ± 0.408
0.698MetPro: 0.698 ± 0.255
1.744MetGln: 1.744 ± 0.379
0.611MetArg: 0.611 ± 0.184
1.657MetSer: 1.657 ± 0.397
3.14MetThr: 3.14 ± 0.571
1.832MetVal: 1.832 ± 0.303
0.349MetTrp: 0.349 ± 0.147
1.134MetTyr: 1.134 ± 0.309
0.0MetXaa: 0.0 ± 0.0
Asn
3.837AsnAla: 3.837 ± 0.624
0.349AsnCys: 0.349 ± 0.149
2.616AsnAsp: 2.616 ± 0.585
3.489AsnGlu: 3.489 ± 0.625
1.657AsnPhe: 1.657 ± 0.44
4.797AsnGly: 4.797 ± 0.817
0.698AsnHis: 0.698 ± 0.25
4.361AsnIle: 4.361 ± 0.536
3.227AsnLys: 3.227 ± 0.611
5.058AsnLeu: 5.058 ± 0.623
0.698AsnMet: 0.698 ± 0.253
2.355AsnAsn: 2.355 ± 0.546
2.442AsnPro: 2.442 ± 0.53
2.529AsnGln: 2.529 ± 0.453
2.616AsnArg: 2.616 ± 0.493
3.314AsnSer: 3.314 ± 0.831
2.878AsnThr: 2.878 ± 0.608
2.965AsnVal: 2.965 ± 0.369
0.698AsnTrp: 0.698 ± 0.288
2.616AsnTyr: 2.616 ± 0.588
0.0AsnXaa: 0.0 ± 0.0
Pro
1.221ProAla: 1.221 ± 0.262
0.0ProCys: 0.0 ± 0.0
2.093ProAsp: 2.093 ± 0.462
2.355ProGlu: 2.355 ± 0.48
2.006ProPhe: 2.006 ± 0.449
0.959ProGly: 0.959 ± 0.274
0.611ProHis: 0.611 ± 0.316
1.919ProIle: 1.919 ± 0.423
2.965ProLys: 2.965 ± 0.627
2.006ProLeu: 2.006 ± 0.404
0.959ProMet: 0.959 ± 0.245
1.134ProAsn: 1.134 ± 0.274
1.134ProPro: 1.134 ± 0.368
1.221ProGln: 1.221 ± 0.396
1.57ProArg: 1.57 ± 0.569
1.308ProSer: 1.308 ± 0.306
1.57ProThr: 1.57 ± 0.513
2.093ProVal: 2.093 ± 0.346
0.436ProTrp: 0.436 ± 0.196
1.134ProTyr: 1.134 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
3.663GlnAla: 3.663 ± 0.607
0.174GlnCys: 0.174 ± 0.131
2.093GlnAsp: 2.093 ± 0.459
3.401GlnGlu: 3.401 ± 0.579
1.657GlnPhe: 1.657 ± 0.39
2.355GlnGly: 2.355 ± 0.596
0.523GlnHis: 0.523 ± 0.203
2.791GlnIle: 2.791 ± 0.593
3.314GlnLys: 3.314 ± 0.53
3.925GlnLeu: 3.925 ± 0.551
0.698GlnMet: 0.698 ± 0.246
2.442GlnAsn: 2.442 ± 0.393
0.698GlnPro: 0.698 ± 0.278
2.704GlnGln: 2.704 ± 0.404
2.442GlnArg: 2.442 ± 0.378
2.704GlnSer: 2.704 ± 0.389
3.227GlnThr: 3.227 ± 0.69
2.965GlnVal: 2.965 ± 0.549
0.523GlnTrp: 0.523 ± 0.17
1.657GlnTyr: 1.657 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
2.704ArgAla: 2.704 ± 0.463
0.523ArgCys: 0.523 ± 0.227
2.704ArgAsp: 2.704 ± 0.395
2.791ArgGlu: 2.791 ± 0.515
1.832ArgPhe: 1.832 ± 0.406
2.18ArgGly: 2.18 ± 0.411
0.959ArgHis: 0.959 ± 0.246
3.576ArgIle: 3.576 ± 0.554
4.535ArgLys: 4.535 ± 0.695
4.448ArgLeu: 4.448 ± 0.591
1.395ArgMet: 1.395 ± 0.307
2.704ArgAsn: 2.704 ± 0.462
1.57ArgPro: 1.57 ± 0.391
2.442ArgGln: 2.442 ± 0.452
1.919ArgArg: 1.919 ± 0.351
2.442ArgSer: 2.442 ± 0.494
2.965ArgThr: 2.965 ± 0.45
2.878ArgVal: 2.878 ± 0.522
0.785ArgTrp: 0.785 ± 0.235
1.744ArgTyr: 1.744 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
5.146SerAla: 5.146 ± 1.242
0.174SerCys: 0.174 ± 0.107
2.529SerAsp: 2.529 ± 0.4
5.233SerGlu: 5.233 ± 0.721
2.268SerPhe: 2.268 ± 0.47
3.925SerGly: 3.925 ± 0.687
1.308SerHis: 1.308 ± 0.267
4.622SerIle: 4.622 ± 0.693
3.401SerLys: 3.401 ± 0.481
4.797SerLeu: 4.797 ± 0.761
2.18SerMet: 2.18 ± 0.434
2.878SerAsn: 2.878 ± 0.507
1.221SerPro: 1.221 ± 0.286
2.355SerGln: 2.355 ± 0.652
2.442SerArg: 2.442 ± 0.448
3.489SerSer: 3.489 ± 0.56
3.489SerThr: 3.489 ± 0.629
4.186SerVal: 4.186 ± 0.702
0.436SerTrp: 0.436 ± 0.189
2.616SerTyr: 2.616 ± 0.506
0.0SerXaa: 0.0 ± 0.0
Thr
4.971ThrAla: 4.971 ± 0.864
0.523ThrCys: 0.523 ± 0.19
3.837ThrAsp: 3.837 ± 0.703
4.099ThrGlu: 4.099 ± 0.584
2.616ThrPhe: 2.616 ± 0.524
5.931ThrGly: 5.931 ± 0.772
1.134ThrHis: 1.134 ± 0.297
3.837ThrIle: 3.837 ± 0.608
4.186ThrLys: 4.186 ± 0.52
5.407ThrLeu: 5.407 ± 0.629
1.047ThrMet: 1.047 ± 0.273
2.791ThrAsn: 2.791 ± 0.528
1.57ThrPro: 1.57 ± 0.316
2.878ThrGln: 2.878 ± 0.558
2.268ThrArg: 2.268 ± 0.379
3.576ThrSer: 3.576 ± 0.64
3.837ThrThr: 3.837 ± 0.529
5.146ThrVal: 5.146 ± 0.717
0.349ThrTrp: 0.349 ± 0.15
2.18ThrTyr: 2.18 ± 0.421
0.0ThrXaa: 0.0 ± 0.0
Val
4.448ValAla: 4.448 ± 0.461
0.174ValCys: 0.174 ± 0.121
5.146ValAsp: 5.146 ± 0.614
6.192ValGlu: 6.192 ± 0.778
2.18ValPhe: 2.18 ± 0.421
3.314ValGly: 3.314 ± 0.572
1.134ValHis: 1.134 ± 0.32
3.837ValIle: 3.837 ± 0.567
4.884ValLys: 4.884 ± 0.623
3.925ValLeu: 3.925 ± 0.728
1.483ValMet: 1.483 ± 0.328
2.268ValAsn: 2.268 ± 0.382
2.355ValPro: 2.355 ± 0.456
2.878ValGln: 2.878 ± 0.585
2.878ValArg: 2.878 ± 0.491
4.274ValSer: 4.274 ± 0.598
4.099ValThr: 4.099 ± 0.713
3.314ValVal: 3.314 ± 0.53
0.698ValTrp: 0.698 ± 0.274
2.616ValTyr: 2.616 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
0.959TrpAla: 0.959 ± 0.325
0.262TrpCys: 0.262 ± 0.132
0.523TrpAsp: 0.523 ± 0.218
0.785TrpGlu: 0.785 ± 0.212
0.349TrpPhe: 0.349 ± 0.152
1.047TrpGly: 1.047 ± 0.315
0.262TrpHis: 0.262 ± 0.122
0.698TrpIle: 0.698 ± 0.264
0.959TrpLys: 0.959 ± 0.275
1.308TrpLeu: 1.308 ± 0.285
0.174TrpMet: 0.174 ± 0.124
0.785TrpAsn: 0.785 ± 0.192
0.087TrpPro: 0.087 ± 0.077
0.174TrpGln: 0.174 ± 0.108
1.047TrpArg: 1.047 ± 0.361
0.523TrpSer: 0.523 ± 0.178
0.611TrpThr: 0.611 ± 0.24
0.523TrpVal: 0.523 ± 0.174
0.0TrpTrp: 0.0 ± 0.0
0.174TrpTyr: 0.174 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.965TyrAla: 2.965 ± 0.532
0.349TyrCys: 0.349 ± 0.175
2.878TyrAsp: 2.878 ± 0.58
2.704TyrGlu: 2.704 ± 0.48
1.832TyrPhe: 1.832 ± 0.366
2.442TyrGly: 2.442 ± 0.336
0.959TyrHis: 0.959 ± 0.246
1.919TyrIle: 1.919 ± 0.429
3.489TyrLys: 3.489 ± 0.593
3.314TyrLeu: 3.314 ± 0.512
1.221TyrMet: 1.221 ± 0.361
2.006TyrAsn: 2.006 ± 0.363
1.134TyrPro: 1.134 ± 0.262
1.657TyrGln: 1.657 ± 0.274
2.093TyrArg: 2.093 ± 0.335
2.093TyrSer: 2.093 ± 0.379
1.57TyrThr: 1.57 ± 0.36
1.919TyrVal: 1.919 ± 0.32
0.349TyrTrp: 0.349 ± 0.191
1.395TyrTyr: 1.395 ± 0.339
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (11467 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski