Amino acid dipepetide frequency for Streptococcus phage 5093

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.867AlaAla: 5.867 ± 2.481
0.275AlaCys: 0.275 ± 0.149
3.484AlaAsp: 3.484 ± 0.566
4.767AlaGlu: 4.767 ± 0.709
2.842AlaPhe: 2.842 ± 0.849
5.226AlaGly: 5.226 ± 1.135
0.825AlaHis: 0.825 ± 0.267
7.334AlaIle: 7.334 ± 1.536
5.959AlaLys: 5.959 ± 0.881
7.059AlaLeu: 7.059 ± 1.109
3.3AlaMet: 3.3 ± 0.815
4.95AlaAsn: 4.95 ± 0.836
2.2AlaPro: 2.2 ± 0.534
3.484AlaGln: 3.484 ± 0.696
2.75AlaArg: 2.75 ± 0.517
4.125AlaSer: 4.125 ± 0.737
4.492AlaThr: 4.492 ± 1.266
4.584AlaVal: 4.584 ± 1.403
0.458AlaTrp: 0.458 ± 0.228
1.925AlaTyr: 1.925 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
0.183CysAla: 0.183 ± 0.119
0.183CysCys: 0.183 ± 0.168
0.642CysAsp: 0.642 ± 0.224
0.55CysGlu: 0.55 ± 0.206
0.0CysPhe: 0.0 ± 0.0
0.275CysGly: 0.275 ± 0.14
0.183CysHis: 0.183 ± 0.141
0.183CysIle: 0.183 ± 0.091
0.458CysLys: 0.458 ± 0.221
0.458CysLeu: 0.458 ± 0.24
0.183CysMet: 0.183 ± 0.134
0.092CysAsn: 0.092 ± 0.092
0.092CysPro: 0.092 ± 0.087
0.092CysGln: 0.092 ± 0.092
0.092CysArg: 0.092 ± 0.078
0.55CysSer: 0.55 ± 0.215
0.092CysThr: 0.092 ± 0.097
0.275CysVal: 0.275 ± 0.161
0.183CysTrp: 0.183 ± 0.114
0.092CysTyr: 0.092 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
3.3AspAla: 3.3 ± 0.478
0.275AspCys: 0.275 ± 0.158
5.134AspAsp: 5.134 ± 1.036
4.492AspGlu: 4.492 ± 0.759
3.667AspPhe: 3.667 ± 0.75
4.125AspGly: 4.125 ± 0.749
0.825AspHis: 0.825 ± 0.295
4.4AspIle: 4.4 ± 0.674
5.317AspLys: 5.317 ± 0.859
3.85AspLeu: 3.85 ± 0.88
2.109AspMet: 2.109 ± 0.418
3.392AspAsn: 3.392 ± 0.613
1.283AspPro: 1.283 ± 0.423
1.834AspGln: 1.834 ± 0.392
2.842AspArg: 2.842 ± 0.692
3.484AspSer: 3.484 ± 0.683
3.117AspThr: 3.117 ± 0.465
2.567AspVal: 2.567 ± 0.526
0.55AspTrp: 0.55 ± 0.2
2.475AspTyr: 2.475 ± 0.487
0.0AspXaa: 0.0 ± 0.0
Glu
4.584GluAla: 4.584 ± 0.726
0.367GluCys: 0.367 ± 0.228
3.85GluAsp: 3.85 ± 0.735
7.059GluGlu: 7.059 ± 1.7
2.842GluPhe: 2.842 ± 0.628
3.209GluGly: 3.209 ± 0.443
1.008GluHis: 1.008 ± 0.262
5.226GluIle: 5.226 ± 1.207
6.601GluLys: 6.601 ± 1.205
7.701GluLeu: 7.701 ± 1.255
2.2GluMet: 2.2 ± 0.652
4.859GluAsn: 4.859 ± 0.941
1.558GluPro: 1.558 ± 0.487
3.117GluGln: 3.117 ± 0.715
3.117GluArg: 3.117 ± 0.546
3.209GluSer: 3.209 ± 0.501
4.125GluThr: 4.125 ± 0.878
5.501GluVal: 5.501 ± 0.846
1.008GluTrp: 1.008 ± 0.296
3.759GluTyr: 3.759 ± 0.892
0.0GluXaa: 0.0 ± 0.0
Phe
2.567PheAla: 2.567 ± 0.395
0.183PheCys: 0.183 ± 0.121
3.484PheAsp: 3.484 ± 0.676
3.667PheGlu: 3.667 ± 0.693
1.283PhePhe: 1.283 ± 0.283
3.484PheGly: 3.484 ± 0.697
0.825PheHis: 0.825 ± 0.308
2.659PheIle: 2.659 ± 0.462
3.667PheLys: 3.667 ± 0.731
1.467PheLeu: 1.467 ± 0.472
1.1PheMet: 1.1 ± 0.246
3.85PheAsn: 3.85 ± 0.532
0.825PhePro: 0.825 ± 0.352
1.467PheGln: 1.467 ± 0.339
1.1PheArg: 1.1 ± 0.353
3.3PheSer: 3.3 ± 0.683
2.017PheThr: 2.017 ± 0.339
1.742PheVal: 1.742 ± 0.418
0.183PheTrp: 0.183 ± 0.12
1.925PheTyr: 1.925 ± 0.49
0.0PheXaa: 0.0 ± 0.0
Gly
3.85GlyAla: 3.85 ± 1.276
0.367GlyCys: 0.367 ± 0.163
3.025GlyAsp: 3.025 ± 0.57
4.034GlyGlu: 4.034 ± 0.728
3.209GlyPhe: 3.209 ± 0.661
3.117GlyGly: 3.117 ± 0.553
1.742GlyHis: 1.742 ± 0.579
6.601GlyIle: 6.601 ± 1.583
4.309GlyLys: 4.309 ± 0.513
5.409GlyLeu: 5.409 ± 1.174
2.017GlyMet: 2.017 ± 0.681
3.3GlyAsn: 3.3 ± 0.652
0.367GlyPro: 0.367 ± 0.16
2.384GlyGln: 2.384 ± 0.428
2.292GlyArg: 2.292 ± 0.488
3.667GlySer: 3.667 ± 0.839
4.95GlyThr: 4.95 ± 1.244
2.567GlyVal: 2.567 ± 0.422
0.367GlyTrp: 0.367 ± 0.159
2.475GlyTyr: 2.475 ± 0.536
0.0GlyXaa: 0.0 ± 0.0
His
0.275HisAla: 0.275 ± 0.157
0.092HisCys: 0.092 ± 0.084
1.008HisAsp: 1.008 ± 0.336
1.1HisGlu: 1.1 ± 0.354
0.642HisPhe: 0.642 ± 0.231
0.642HisGly: 0.642 ± 0.239
0.367HisHis: 0.367 ± 0.207
0.733HisIle: 0.733 ± 0.264
1.192HisLys: 1.192 ± 0.319
1.283HisLeu: 1.283 ± 0.295
0.092HisMet: 0.092 ± 0.077
0.458HisAsn: 0.458 ± 0.232
0.825HisPro: 0.825 ± 0.24
0.642HisGln: 0.642 ± 0.223
0.825HisArg: 0.825 ± 0.212
0.733HisSer: 0.733 ± 0.323
0.642HisThr: 0.642 ± 0.244
1.008HisVal: 1.008 ± 0.281
0.275HisTrp: 0.275 ± 0.151
0.55HisTyr: 0.55 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
7.059IleAla: 7.059 ± 1.329
0.458IleCys: 0.458 ± 0.242
5.134IleAsp: 5.134 ± 0.739
6.142IleGlu: 6.142 ± 1.271
1.375IlePhe: 1.375 ± 0.34
5.959IleGly: 5.959 ± 1.191
1.008IleHis: 1.008 ± 0.301
4.034IleIle: 4.034 ± 0.831
7.242IleLys: 7.242 ± 0.871
4.675IleLeu: 4.675 ± 0.762
1.192IleMet: 1.192 ± 0.4
4.95IleAsn: 4.95 ± 0.671
2.109IlePro: 2.109 ± 0.407
2.384IleGln: 2.384 ± 0.374
2.75IleArg: 2.75 ± 0.624
6.142IleSer: 6.142 ± 1.312
4.492IleThr: 4.492 ± 0.807
3.392IleVal: 3.392 ± 0.517
0.458IleTrp: 0.458 ± 0.24
2.017IleTyr: 2.017 ± 0.462
0.0IleXaa: 0.0 ± 0.0
Lys
6.967LysAla: 6.967 ± 0.761
0.642LysCys: 0.642 ± 0.273
3.942LysAsp: 3.942 ± 0.598
6.417LysGlu: 6.417 ± 1.268
2.842LysPhe: 2.842 ± 0.562
4.492LysGly: 4.492 ± 0.479
1.192LysHis: 1.192 ± 0.351
5.134LysIle: 5.134 ± 0.482
6.417LysLys: 6.417 ± 1.219
6.692LysLeu: 6.692 ± 1.071
2.659LysMet: 2.659 ± 0.487
3.392LysAsn: 3.392 ± 0.582
2.934LysPro: 2.934 ± 0.575
4.309LysGln: 4.309 ± 0.764
4.767LysArg: 4.767 ± 0.952
5.592LysSer: 5.592 ± 0.788
5.684LysThr: 5.684 ± 0.672
4.859LysVal: 4.859 ± 0.609
1.008LysTrp: 1.008 ± 0.313
3.575LysTyr: 3.575 ± 0.667
0.0LysXaa: 0.0 ± 0.0
Leu
6.326LeuAla: 6.326 ± 0.866
0.183LeuCys: 0.183 ± 0.129
5.317LeuAsp: 5.317 ± 0.714
7.151LeuGlu: 7.151 ± 1.226
2.75LeuPhe: 2.75 ± 0.477
5.134LeuGly: 5.134 ± 0.78
1.008LeuHis: 1.008 ± 0.367
3.575LeuIle: 3.575 ± 0.604
7.792LeuLys: 7.792 ± 1.017
4.492LeuLeu: 4.492 ± 0.682
1.834LeuMet: 1.834 ± 0.434
5.226LeuAsn: 5.226 ± 0.61
2.384LeuPro: 2.384 ± 0.554
3.209LeuGln: 3.209 ± 0.505
3.667LeuArg: 3.667 ± 0.705
6.326LeuSer: 6.326 ± 0.932
5.867LeuThr: 5.867 ± 0.798
4.034LeuVal: 4.034 ± 0.67
0.458LeuTrp: 0.458 ± 0.197
2.75LeuTyr: 2.75 ± 0.625
0.0LeuXaa: 0.0 ± 0.0
Met
2.2MetAla: 2.2 ± 0.89
0.092MetCys: 0.092 ± 0.089
1.283MetAsp: 1.283 ± 0.482
1.467MetGlu: 1.467 ± 0.377
1.192MetPhe: 1.192 ± 0.291
1.742MetGly: 1.742 ± 0.404
0.458MetHis: 0.458 ± 0.203
2.384MetIle: 2.384 ± 0.476
3.575MetLys: 3.575 ± 0.622
2.2MetLeu: 2.2 ± 0.52
1.467MetMet: 1.467 ± 0.479
1.558MetAsn: 1.558 ± 0.361
0.458MetPro: 0.458 ± 0.186
1.65MetGln: 1.65 ± 0.504
1.467MetArg: 1.467 ± 0.446
2.017MetSer: 2.017 ± 0.443
2.2MetThr: 2.2 ± 0.507
1.834MetVal: 1.834 ± 0.633
0.275MetTrp: 0.275 ± 0.148
0.825MetTyr: 0.825 ± 0.279
0.0MetXaa: 0.0 ± 0.0
Asn
3.85AsnAla: 3.85 ± 0.503
0.367AsnCys: 0.367 ± 0.156
3.759AsnAsp: 3.759 ± 0.686
3.85AsnGlu: 3.85 ± 0.654
2.75AsnPhe: 2.75 ± 0.626
3.85AsnGly: 3.85 ± 0.593
0.642AsnHis: 0.642 ± 0.222
3.392AsnIle: 3.392 ± 0.601
5.042AsnLys: 5.042 ± 0.8
4.309AsnLeu: 4.309 ± 0.637
1.375AsnMet: 1.375 ± 0.337
2.292AsnAsn: 2.292 ± 0.42
2.017AsnPro: 2.017 ± 0.38
2.659AsnGln: 2.659 ± 0.516
1.742AsnArg: 1.742 ± 0.493
4.309AsnSer: 4.309 ± 0.63
3.117AsnThr: 3.117 ± 0.546
3.3AsnVal: 3.3 ± 0.674
1.283AsnTrp: 1.283 ± 0.364
2.842AsnTyr: 2.842 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
1.283ProAla: 1.283 ± 0.369
0.092ProCys: 0.092 ± 0.092
1.283ProAsp: 1.283 ± 0.304
2.2ProGlu: 2.2 ± 0.538
1.283ProPhe: 1.283 ± 0.351
0.825ProGly: 0.825 ± 0.29
0.367ProHis: 0.367 ± 0.173
2.475ProIle: 2.475 ± 0.405
2.292ProLys: 2.292 ± 0.509
2.017ProLeu: 2.017 ± 0.555
0.183ProMet: 0.183 ± 0.129
1.192ProAsn: 1.192 ± 0.386
0.733ProPro: 0.733 ± 0.207
1.558ProGln: 1.558 ± 0.348
1.1ProArg: 1.1 ± 0.403
1.467ProSer: 1.467 ± 0.395
1.834ProThr: 1.834 ± 0.429
1.742ProVal: 1.742 ± 0.458
0.183ProTrp: 0.183 ± 0.13
2.017ProTyr: 2.017 ± 0.537
0.0ProXaa: 0.0 ± 0.0
Gln
5.776GlnAla: 5.776 ± 0.999
0.092GlnCys: 0.092 ± 0.097
1.558GlnAsp: 1.558 ± 0.353
2.659GlnGlu: 2.659 ± 0.679
2.109GlnPhe: 2.109 ± 0.361
2.659GlnGly: 2.659 ± 0.674
0.183GlnHis: 0.183 ± 0.127
3.942GlnIle: 3.942 ± 0.822
3.759GlnLys: 3.759 ± 0.724
3.942GlnLeu: 3.942 ± 0.823
2.292GlnMet: 2.292 ± 0.465
1.283GlnAsn: 1.283 ± 0.334
0.367GlnPro: 0.367 ± 0.189
3.025GlnGln: 3.025 ± 0.661
1.283GlnArg: 1.283 ± 0.348
3.392GlnSer: 3.392 ± 0.588
1.925GlnThr: 1.925 ± 0.355
2.384GlnVal: 2.384 ± 0.506
0.825GlnTrp: 0.825 ± 0.234
0.917GlnTyr: 0.917 ± 0.351
0.0GlnXaa: 0.0 ± 0.0
Arg
2.842ArgAla: 2.842 ± 0.494
0.275ArgCys: 0.275 ± 0.16
3.117ArgAsp: 3.117 ± 0.749
4.217ArgGlu: 4.217 ± 0.952
0.917ArgPhe: 0.917 ± 0.308
1.834ArgGly: 1.834 ± 0.589
0.642ArgHis: 0.642 ± 0.213
2.292ArgIle: 2.292 ± 0.4
3.392ArgLys: 3.392 ± 0.727
3.942ArgLeu: 3.942 ± 0.702
1.558ArgMet: 1.558 ± 0.458
2.475ArgAsn: 2.475 ± 0.489
1.1ArgPro: 1.1 ± 0.466
1.283ArgGln: 1.283 ± 0.286
1.283ArgArg: 1.283 ± 0.37
1.925ArgSer: 1.925 ± 0.354
2.109ArgThr: 2.109 ± 0.438
2.659ArgVal: 2.659 ± 0.678
0.458ArgTrp: 0.458 ± 0.217
2.2ArgTyr: 2.2 ± 0.508
0.0ArgXaa: 0.0 ± 0.0
Ser
6.509SerAla: 6.509 ± 2.742
0.458SerCys: 0.458 ± 0.219
3.117SerAsp: 3.117 ± 0.653
3.942SerGlu: 3.942 ± 0.695
2.934SerPhe: 2.934 ± 0.461
5.042SerGly: 5.042 ± 0.837
0.642SerHis: 0.642 ± 0.245
5.226SerIle: 5.226 ± 0.673
4.584SerLys: 4.584 ± 0.752
6.876SerLeu: 6.876 ± 0.922
1.925SerMet: 1.925 ± 0.407
3.759SerAsn: 3.759 ± 0.673
2.017SerPro: 2.017 ± 0.355
4.034SerGln: 4.034 ± 0.759
2.475SerArg: 2.475 ± 0.592
4.675SerSer: 4.675 ± 0.725
3.575SerThr: 3.575 ± 0.713
4.492SerVal: 4.492 ± 0.692
0.458SerTrp: 0.458 ± 0.192
1.375SerTyr: 1.375 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
4.95ThrAla: 4.95 ± 1.3
0.0ThrCys: 0.0 ± 0.0
3.759ThrAsp: 3.759 ± 0.716
3.667ThrGlu: 3.667 ± 0.621
3.117ThrPhe: 3.117 ± 0.484
2.934ThrGly: 2.934 ± 0.764
0.55ThrHis: 0.55 ± 0.245
5.134ThrIle: 5.134 ± 0.651
4.584ThrLys: 4.584 ± 0.705
4.584ThrLeu: 4.584 ± 0.714
1.742ThrMet: 1.742 ± 0.5
3.392ThrAsn: 3.392 ± 0.565
2.2ThrPro: 2.2 ± 0.46
3.209ThrGln: 3.209 ± 0.808
2.292ThrArg: 2.292 ± 0.546
4.309ThrSer: 4.309 ± 1.066
4.309ThrThr: 4.309 ± 0.685
4.217ThrVal: 4.217 ± 0.508
0.367ThrTrp: 0.367 ± 0.139
2.384ThrTyr: 2.384 ± 0.53
0.0ThrXaa: 0.0 ± 0.0
Val
5.042ValAla: 5.042 ± 0.655
0.275ValCys: 0.275 ± 0.188
2.384ValAsp: 2.384 ± 0.463
4.034ValGlu: 4.034 ± 0.867
2.842ValPhe: 2.842 ± 0.482
3.209ValGly: 3.209 ± 0.797
0.183ValHis: 0.183 ± 0.117
4.584ValIle: 4.584 ± 0.627
4.309ValLys: 4.309 ± 0.664
3.025ValLeu: 3.025 ± 0.526
1.375ValMet: 1.375 ± 0.439
3.85ValAsn: 3.85 ± 0.692
1.65ValPro: 1.65 ± 0.32
2.017ValGln: 2.017 ± 0.652
1.742ValArg: 1.742 ± 0.352
4.584ValSer: 4.584 ± 0.749
4.309ValThr: 4.309 ± 0.455
3.484ValVal: 3.484 ± 0.747
0.917ValTrp: 0.917 ± 0.354
2.75ValTyr: 2.75 ± 0.569
0.0ValXaa: 0.0 ± 0.0
Trp
0.55TrpAla: 0.55 ± 0.217
0.0TrpCys: 0.0 ± 0.0
0.917TrpAsp: 0.917 ± 0.305
0.642TrpGlu: 0.642 ± 0.239
0.183TrpPhe: 0.183 ± 0.116
0.183TrpGly: 0.183 ± 0.117
0.275TrpHis: 0.275 ± 0.154
0.642TrpIle: 0.642 ± 0.231
0.642TrpLys: 0.642 ± 0.219
0.917TrpLeu: 0.917 ± 0.288
0.367TrpMet: 0.367 ± 0.185
0.55TrpAsn: 0.55 ± 0.167
0.275TrpPro: 0.275 ± 0.17
0.55TrpGln: 0.55 ± 0.189
0.917TrpArg: 0.917 ± 0.326
0.55TrpSer: 0.55 ± 0.221
0.825TrpThr: 0.825 ± 0.335
0.55TrpVal: 0.55 ± 0.318
0.183TrpTrp: 0.183 ± 0.096
0.642TrpTyr: 0.642 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.925TyrAla: 1.925 ± 0.497
0.183TyrCys: 0.183 ± 0.128
2.842TyrAsp: 2.842 ± 0.692
2.842TyrGlu: 2.842 ± 0.614
2.017TyrPhe: 2.017 ± 0.513
2.109TyrGly: 2.109 ± 0.448
0.458TyrHis: 0.458 ± 0.217
3.025TyrIle: 3.025 ± 0.576
2.475TyrLys: 2.475 ± 0.67
4.584TyrLeu: 4.584 ± 0.828
1.283TyrMet: 1.283 ± 0.353
1.925TyrAsn: 1.925 ± 0.387
0.733TyrPro: 0.733 ± 0.254
1.467TyrGln: 1.467 ± 0.499
2.017TyrArg: 2.017 ± 0.513
3.759TyrSer: 3.759 ± 0.567
2.017TyrThr: 2.017 ± 0.503
1.375TyrVal: 1.375 ± 0.407
0.458TyrTrp: 0.458 ± 0.179
2.75TyrTyr: 2.75 ± 0.681
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10909 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski