Amino acid dipepetide frequency for Streptococcus phage phiST1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.158AlaAla: 5.158 ± 1.235
0.164AlaCys: 0.164 ± 0.123
5.567AlaAsp: 5.567 ± 1.051
7.041AlaGlu: 7.041 ± 1.089
2.374AlaPhe: 2.374 ± 0.7
6.14AlaGly: 6.14 ± 0.958
0.327AlaHis: 0.327 ± 0.152
5.976AlaIle: 5.976 ± 1.136
7.041AlaLys: 7.041 ± 1.33
6.304AlaLeu: 6.304 ± 1.188
2.456AlaMet: 2.456 ± 0.865
4.912AlaAsn: 4.912 ± 0.665
2.292AlaPro: 2.292 ± 0.351
3.275AlaGln: 3.275 ± 0.616
2.702AlaArg: 2.702 ± 0.416
3.438AlaSer: 3.438 ± 0.801
4.257AlaThr: 4.257 ± 0.518
5.239AlaVal: 5.239 ± 1.091
1.064AlaTrp: 1.064 ± 0.586
2.456AlaTyr: 2.456 ± 0.697
0.0AlaXaa: 0.0 ± 0.0
Cys
0.327CysAla: 0.327 ± 0.161
0.0CysCys: 0.0 ± 0.0
0.246CysAsp: 0.246 ± 0.134
0.573CysGlu: 0.573 ± 0.18
0.327CysPhe: 0.327 ± 0.16
0.164CysGly: 0.164 ± 0.103
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.327CysLys: 0.327 ± 0.173
0.246CysLeu: 0.246 ± 0.154
0.082CysMet: 0.082 ± 0.091
0.082CysAsn: 0.082 ± 0.069
0.0CysPro: 0.0 ± 0.0
0.327CysGln: 0.327 ± 0.14
0.082CysArg: 0.082 ± 0.064
0.327CysSer: 0.327 ± 0.161
0.164CysThr: 0.164 ± 0.11
0.409CysVal: 0.409 ± 0.198
0.0CysTrp: 0.0 ± 0.0
0.246CysTyr: 0.246 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
6.467AspAla: 6.467 ± 1.971
0.164AspCys: 0.164 ± 0.109
5.239AspAsp: 5.239 ± 0.954
4.093AspGlu: 4.093 ± 0.742
3.275AspPhe: 3.275 ± 0.482
6.467AspGly: 6.467 ± 1.078
0.819AspHis: 0.819 ± 0.237
4.585AspIle: 4.585 ± 0.778
4.83AspLys: 4.83 ± 0.648
4.994AspLeu: 4.994 ± 0.764
1.637AspMet: 1.637 ± 0.354
4.011AspAsn: 4.011 ± 0.532
1.474AspPro: 1.474 ± 0.379
1.392AspGln: 1.392 ± 0.568
2.374AspArg: 2.374 ± 0.46
4.912AspSer: 4.912 ± 0.896
4.503AspThr: 4.503 ± 0.629
3.275AspVal: 3.275 ± 0.605
0.327AspTrp: 0.327 ± 0.139
2.62AspTyr: 2.62 ± 0.643
0.0AspXaa: 0.0 ± 0.0
Glu
3.684GluAla: 3.684 ± 0.621
0.246GluCys: 0.246 ± 0.128
2.783GluAsp: 2.783 ± 0.479
4.666GluGlu: 4.666 ± 0.664
3.111GluPhe: 3.111 ± 0.504
1.883GluGly: 1.883 ± 0.373
1.228GluHis: 1.228 ± 0.361
5.321GluIle: 5.321 ± 0.757
5.485GluLys: 5.485 ± 0.69
6.795GluLeu: 6.795 ± 0.99
1.801GluMet: 1.801 ± 0.399
4.257GluAsn: 4.257 ± 0.662
1.31GluPro: 1.31 ± 0.386
3.438GluGln: 3.438 ± 0.562
3.93GluArg: 3.93 ± 0.7
2.538GluSer: 2.538 ± 0.445
3.52GluThr: 3.52 ± 0.713
5.158GluVal: 5.158 ± 1.08
0.573GluTrp: 0.573 ± 0.192
2.702GluTyr: 2.702 ± 0.562
0.0GluXaa: 0.0 ± 0.0
Phe
2.62PheAla: 2.62 ± 0.332
0.082PheCys: 0.082 ± 0.079
3.438PheAsp: 3.438 ± 0.576
3.029PheGlu: 3.029 ± 0.557
0.819PhePhe: 0.819 ± 0.225
2.047PheGly: 2.047 ± 0.42
0.573PheHis: 0.573 ± 0.267
2.374PheIle: 2.374 ± 0.304
2.865PheLys: 2.865 ± 0.438
3.029PheLeu: 3.029 ± 0.54
0.655PheMet: 0.655 ± 0.214
2.865PheAsn: 2.865 ± 0.549
0.573PhePro: 0.573 ± 0.224
1.801PheGln: 1.801 ± 0.455
0.737PheArg: 0.737 ± 0.22
3.357PheSer: 3.357 ± 0.746
2.456PheThr: 2.456 ± 0.515
2.538PheVal: 2.538 ± 0.41
0.409PheTrp: 0.409 ± 0.225
1.146PheTyr: 1.146 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
5.731GlyAla: 5.731 ± 1.054
0.327GlyCys: 0.327 ± 0.18
4.994GlyAsp: 4.994 ± 1.728
3.275GlyGlu: 3.275 ± 0.43
2.702GlyPhe: 2.702 ± 0.495
3.766GlyGly: 3.766 ± 0.537
1.064GlyHis: 1.064 ± 0.307
6.795GlyIle: 6.795 ± 1.153
6.058GlyLys: 6.058 ± 0.665
4.666GlyLeu: 4.666 ± 0.834
2.374GlyMet: 2.374 ± 0.322
4.666GlyAsn: 4.666 ± 1.058
0.573GlyPro: 0.573 ± 0.427
3.111GlyGln: 3.111 ± 0.474
3.111GlyArg: 3.111 ± 0.553
4.093GlySer: 4.093 ± 0.783
4.994GlyThr: 4.994 ± 0.903
6.058GlyVal: 6.058 ± 0.774
0.655GlyTrp: 0.655 ± 0.183
2.702GlyTyr: 2.702 ± 0.417
0.0GlyXaa: 0.0 ± 0.0
His
0.819HisAla: 0.819 ± 0.223
0.082HisCys: 0.082 ± 0.088
0.737HisAsp: 0.737 ± 0.256
1.064HisGlu: 1.064 ± 0.29
0.491HisPhe: 0.491 ± 0.199
1.146HisGly: 1.146 ± 0.308
0.409HisHis: 0.409 ± 0.175
1.392HisIle: 1.392 ± 0.406
1.064HisLys: 1.064 ± 0.427
1.228HisLeu: 1.228 ± 0.313
0.0HisMet: 0.0 ± 0.0
0.655HisAsn: 0.655 ± 0.203
0.655HisPro: 0.655 ± 0.293
0.573HisGln: 0.573 ± 0.201
0.982HisArg: 0.982 ± 0.327
0.655HisSer: 0.655 ± 0.263
0.901HisThr: 0.901 ± 0.275
0.737HisVal: 0.737 ± 0.247
0.409HisTrp: 0.409 ± 0.204
0.819HisTyr: 0.819 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
6.222IleAla: 6.222 ± 0.959
0.246IleCys: 0.246 ± 0.112
5.485IleAsp: 5.485 ± 0.668
5.813IleGlu: 5.813 ± 0.628
1.392IlePhe: 1.392 ± 0.381
5.076IleGly: 5.076 ± 0.906
1.474IleHis: 1.474 ± 0.384
3.684IleIle: 3.684 ± 0.863
7.041IleLys: 7.041 ± 0.911
3.684IleLeu: 3.684 ± 0.558
0.982IleMet: 0.982 ± 0.277
3.111IleAsn: 3.111 ± 0.549
2.292IlePro: 2.292 ± 0.461
2.374IleGln: 2.374 ± 0.459
3.193IleArg: 3.193 ± 0.467
4.093IleSer: 4.093 ± 0.703
5.485IleThr: 5.485 ± 0.762
3.93IleVal: 3.93 ± 0.572
0.655IleTrp: 0.655 ± 0.226
2.538IleTyr: 2.538 ± 0.471
0.0IleXaa: 0.0 ± 0.0
Lys
5.731LysAla: 5.731 ± 0.822
0.246LysCys: 0.246 ± 0.137
4.994LysAsp: 4.994 ± 0.764
5.485LysGlu: 5.485 ± 0.679
2.21LysPhe: 2.21 ± 0.413
5.649LysGly: 5.649 ± 1.077
1.555LysHis: 1.555 ± 0.427
5.158LysIle: 5.158 ± 0.836
7.286LysLys: 7.286 ± 1.137
5.894LysLeu: 5.894 ± 0.877
2.129LysMet: 2.129 ± 0.537
3.52LysAsn: 3.52 ± 0.442
2.456LysPro: 2.456 ± 0.499
3.029LysGln: 3.029 ± 0.63
3.602LysArg: 3.602 ± 0.6
4.83LysSer: 4.83 ± 0.663
5.321LysThr: 5.321 ± 0.722
5.976LysVal: 5.976 ± 1.535
0.655LysTrp: 0.655 ± 0.274
2.947LysTyr: 2.947 ± 0.653
0.0LysXaa: 0.0 ± 0.0
Leu
6.467LeuAla: 6.467 ± 0.633
0.409LeuCys: 0.409 ± 0.203
6.959LeuAsp: 6.959 ± 0.996
4.994LeuGlu: 4.994 ± 0.908
2.21LeuPhe: 2.21 ± 0.41
4.994LeuGly: 4.994 ± 1.037
1.228LeuHis: 1.228 ± 0.339
4.585LeuIle: 4.585 ± 0.648
7.041LeuLys: 7.041 ± 0.763
4.83LeuLeu: 4.83 ± 0.739
0.819LeuMet: 0.819 ± 0.263
5.239LeuAsn: 5.239 ± 0.683
2.374LeuPro: 2.374 ± 0.53
3.111LeuGln: 3.111 ± 0.521
2.456LeuArg: 2.456 ± 0.524
7.122LeuSer: 7.122 ± 0.846
4.257LeuThr: 4.257 ± 0.656
4.339LeuVal: 4.339 ± 0.592
0.655LeuTrp: 0.655 ± 0.255
2.374LeuTyr: 2.374 ± 0.475
0.0LeuXaa: 0.0 ± 0.0
Met
2.129MetAla: 2.129 ± 0.578
0.082MetCys: 0.082 ± 0.084
1.474MetAsp: 1.474 ± 0.34
1.146MetGlu: 1.146 ± 0.322
0.737MetPhe: 0.737 ± 0.208
1.31MetGly: 1.31 ± 0.322
0.246MetHis: 0.246 ± 0.125
1.555MetIle: 1.555 ± 0.409
1.965MetLys: 1.965 ± 0.444
1.801MetLeu: 1.801 ± 0.369
1.064MetMet: 1.064 ± 0.31
1.064MetAsn: 1.064 ± 0.275
1.392MetPro: 1.392 ± 0.813
1.719MetGln: 1.719 ± 0.45
1.228MetArg: 1.228 ± 0.305
1.228MetSer: 1.228 ± 0.462
2.21MetThr: 2.21 ± 0.3
1.555MetVal: 1.555 ± 0.5
0.327MetTrp: 0.327 ± 0.157
1.064MetTyr: 1.064 ± 0.401
0.0MetXaa: 0.0 ± 0.0
Asn
5.485AsnAla: 5.485 ± 0.884
0.164AsnCys: 0.164 ± 0.118
3.684AsnAsp: 3.684 ± 0.611
2.865AsnGlu: 2.865 ± 0.437
1.637AsnPhe: 1.637 ± 0.334
4.257AsnGly: 4.257 ± 0.728
0.655AsnHis: 0.655 ± 0.213
3.029AsnIle: 3.029 ± 0.488
3.684AsnLys: 3.684 ± 0.584
4.666AsnLeu: 4.666 ± 0.507
1.392AsnMet: 1.392 ± 0.383
3.111AsnAsn: 3.111 ± 0.541
2.538AsnPro: 2.538 ± 0.469
2.21AsnGln: 2.21 ± 0.714
2.21AsnArg: 2.21 ± 0.532
3.111AsnSer: 3.111 ± 0.504
3.357AsnThr: 3.357 ± 0.801
3.93AsnVal: 3.93 ± 0.648
0.655AsnTrp: 0.655 ± 0.216
1.883AsnTyr: 1.883 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
2.047ProAla: 2.047 ± 0.381
0.164ProCys: 0.164 ± 0.137
1.965ProAsp: 1.965 ± 0.548
1.228ProGlu: 1.228 ± 0.25
1.146ProPhe: 1.146 ± 0.267
1.719ProGly: 1.719 ± 0.385
0.409ProHis: 0.409 ± 0.174
1.801ProIle: 1.801 ± 0.322
2.21ProLys: 2.21 ± 0.631
2.21ProLeu: 2.21 ± 0.488
0.819ProMet: 0.819 ± 0.307
1.965ProAsn: 1.965 ± 0.415
0.573ProPro: 0.573 ± 0.226
0.901ProGln: 0.901 ± 0.261
1.064ProArg: 1.064 ± 0.317
2.047ProSer: 2.047 ± 0.422
2.21ProThr: 2.21 ± 0.456
1.965ProVal: 1.965 ± 0.428
0.164ProTrp: 0.164 ± 0.087
1.637ProTyr: 1.637 ± 0.396
0.0ProXaa: 0.0 ± 0.0
Gln
4.421GlnAla: 4.421 ± 0.817
0.246GlnCys: 0.246 ± 0.145
1.801GlnAsp: 1.801 ± 0.399
2.702GlnGlu: 2.702 ± 0.524
0.982GlnPhe: 0.982 ± 0.321
4.585GlnGly: 4.585 ± 1.046
0.655GlnHis: 0.655 ± 0.238
2.292GlnIle: 2.292 ± 0.388
1.965GlnLys: 1.965 ± 0.415
2.702GlnLeu: 2.702 ± 0.526
1.474GlnMet: 1.474 ± 0.366
1.965GlnAsn: 1.965 ± 0.377
0.901GlnPro: 0.901 ± 0.317
2.456GlnGln: 2.456 ± 0.677
1.719GlnArg: 1.719 ± 0.328
3.029GlnSer: 3.029 ± 0.712
2.21GlnThr: 2.21 ± 0.501
2.374GlnVal: 2.374 ± 0.376
0.246GlnTrp: 0.246 ± 0.103
1.637GlnTyr: 1.637 ± 0.425
0.0GlnXaa: 0.0 ± 0.0
Arg
3.275ArgAla: 3.275 ± 0.615
0.164ArgCys: 0.164 ± 0.126
2.21ArgAsp: 2.21 ± 0.326
2.129ArgGlu: 2.129 ± 0.463
1.801ArgPhe: 1.801 ± 0.434
2.129ArgGly: 2.129 ± 0.39
0.655ArgHis: 0.655 ± 0.197
3.111ArgIle: 3.111 ± 0.563
3.193ArgLys: 3.193 ± 0.62
4.011ArgLeu: 4.011 ± 0.557
1.064ArgMet: 1.064 ± 0.274
1.801ArgAsn: 1.801 ± 0.344
1.474ArgPro: 1.474 ± 0.35
1.474ArgGln: 1.474 ± 0.384
1.719ArgArg: 1.719 ± 0.441
1.719ArgSer: 1.719 ± 0.386
2.783ArgThr: 2.783 ± 0.591
2.21ArgVal: 2.21 ± 0.474
1.146ArgTrp: 1.146 ± 0.419
1.965ArgTyr: 1.965 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
4.83SerAla: 4.83 ± 0.864
0.327SerCys: 0.327 ± 0.147
4.585SerAsp: 4.585 ± 0.931
3.275SerGlu: 3.275 ± 0.541
3.029SerPhe: 3.029 ± 0.781
5.485SerGly: 5.485 ± 1.087
0.491SerHis: 0.491 ± 0.17
3.438SerIle: 3.438 ± 0.533
3.275SerLys: 3.275 ± 0.512
5.485SerLeu: 5.485 ± 0.849
1.637SerMet: 1.637 ± 0.54
3.602SerAsn: 3.602 ± 0.501
1.719SerPro: 1.719 ± 0.403
2.047SerGln: 2.047 ± 0.405
1.555SerArg: 1.555 ± 0.406
4.503SerSer: 4.503 ± 0.85
4.83SerThr: 4.83 ± 0.841
4.011SerVal: 4.011 ± 0.572
0.819SerTrp: 0.819 ± 0.307
3.029SerTyr: 3.029 ± 0.518
0.0SerXaa: 0.0 ± 0.0
Thr
4.503ThrAla: 4.503 ± 0.635
0.164ThrCys: 0.164 ± 0.1
4.421ThrAsp: 4.421 ± 0.909
3.52ThrGlu: 3.52 ± 0.65
3.766ThrPhe: 3.766 ± 0.529
5.731ThrGly: 5.731 ± 0.732
1.228ThrHis: 1.228 ± 0.368
5.403ThrIle: 5.403 ± 0.94
5.485ThrLys: 5.485 ± 0.649
5.976ThrLeu: 5.976 ± 0.783
1.555ThrMet: 1.555 ± 0.362
2.702ThrAsn: 2.702 ± 0.571
2.62ThrPro: 2.62 ± 0.479
2.702ThrGln: 2.702 ± 0.618
1.965ThrArg: 1.965 ± 0.361
2.538ThrSer: 2.538 ± 0.641
4.83ThrThr: 4.83 ± 0.571
5.567ThrVal: 5.567 ± 0.623
1.228ThrTrp: 1.228 ± 0.43
3.111ThrTyr: 3.111 ± 0.641
0.0ThrXaa: 0.0 ± 0.0
Val
5.239ValAla: 5.239 ± 0.746
0.409ValCys: 0.409 ± 0.169
4.011ValAsp: 4.011 ± 0.529
5.158ValGlu: 5.158 ± 0.675
3.111ValPhe: 3.111 ± 0.413
4.175ValGly: 4.175 ± 0.579
0.901ValHis: 0.901 ± 0.293
4.503ValIle: 4.503 ± 0.563
4.011ValLys: 4.011 ± 0.657
4.011ValLeu: 4.011 ± 0.543
1.801ValMet: 1.801 ± 0.331
2.374ValAsn: 2.374 ± 0.403
1.883ValPro: 1.883 ± 0.347
2.21ValGln: 2.21 ± 0.571
1.883ValArg: 1.883 ± 0.433
5.485ValSer: 5.485 ± 0.854
6.222ValThr: 6.222 ± 0.766
3.766ValVal: 3.766 ± 0.833
1.31ValTrp: 1.31 ± 0.764
3.602ValTyr: 3.602 ± 1.264
0.0ValXaa: 0.0 ± 0.0
Trp
0.901TrpAla: 0.901 ± 0.331
0.0TrpCys: 0.0 ± 0.0
0.573TrpAsp: 0.573 ± 0.243
0.491TrpGlu: 0.491 ± 0.189
0.491TrpPhe: 0.491 ± 0.22
1.392TrpGly: 1.392 ± 0.751
0.164TrpHis: 0.164 ± 0.123
0.655TrpIle: 0.655 ± 0.303
0.737TrpLys: 0.737 ± 0.253
0.409TrpLeu: 0.409 ± 0.185
0.082TrpMet: 0.082 ± 0.084
0.409TrpAsn: 0.409 ± 0.146
0.0TrpPro: 0.0 ± 0.0
0.246TrpGln: 0.246 ± 0.197
1.146TrpArg: 1.146 ± 0.27
1.064TrpSer: 1.064 ± 0.356
0.901TrpThr: 0.901 ± 0.417
1.31TrpVal: 1.31 ± 0.552
0.327TrpTrp: 0.327 ± 0.187
0.491TrpTyr: 0.491 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.129TyrAla: 2.129 ± 0.446
0.246TyrCys: 0.246 ± 0.165
2.292TyrAsp: 2.292 ± 0.382
1.883TyrGlu: 1.883 ± 0.522
1.801TyrPhe: 1.801 ± 0.473
3.602TyrGly: 3.602 ± 1.444
0.737TyrHis: 0.737 ± 0.299
3.111TyrIle: 3.111 ± 0.652
3.111TyrLys: 3.111 ± 0.461
3.52TyrLeu: 3.52 ± 0.616
1.392TyrMet: 1.392 ± 0.435
2.21TyrAsn: 2.21 ± 0.484
1.146TyrPro: 1.146 ± 0.249
1.965TyrGln: 1.965 ± 0.289
2.374TyrArg: 2.374 ± 0.441
1.883TyrSer: 1.883 ± 0.39
3.602TyrThr: 3.602 ± 0.804
1.801TyrVal: 1.801 ± 0.326
0.164TyrTrp: 0.164 ± 0.126
1.883TyrTyr: 1.883 ± 0.439
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12216 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski