Amino acid dipepetide frequency for Bacillus phage Hyb2phi3Ts-SPbeta

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.583AlaAla: 3.583 ± 0.368
0.512AlaCys: 0.512 ± 0.145
3.421AlaAsp: 3.421 ± 0.334
3.125AlaGlu: 3.125 ± 0.33
2.263AlaPhe: 2.263 ± 0.293
2.613AlaGly: 2.613 ± 0.309
0.862AlaHis: 0.862 ± 0.173
4.66AlaIle: 4.66 ± 0.445
5.01AlaLys: 5.01 ± 0.567
4.175AlaLeu: 4.175 ± 0.474
1.535AlaMet: 1.535 ± 0.238
2.263AlaAsn: 2.263 ± 0.283
1.347AlaPro: 1.347 ± 0.197
1.508AlaGln: 1.508 ± 0.32
1.724AlaArg: 1.724 ± 0.227
3.394AlaSer: 3.394 ± 0.419
3.017AlaThr: 3.017 ± 0.281
3.071AlaVal: 3.071 ± 0.293
0.485AlaTrp: 0.485 ± 0.114
2.721AlaTyr: 2.721 ± 0.28
0.0AlaXaa: 0.0 ± 0.0
Cys
0.35CysAla: 0.35 ± 0.097
0.189CysCys: 0.189 ± 0.066
0.35CysAsp: 0.35 ± 0.109
0.727CysGlu: 0.727 ± 0.154
0.781CysPhe: 0.781 ± 0.142
0.808CysGly: 0.808 ± 0.201
0.269CysHis: 0.269 ± 0.082
0.62CysIle: 0.62 ± 0.115
0.7CysLys: 0.7 ± 0.178
0.781CysLeu: 0.781 ± 0.146
0.162CysMet: 0.162 ± 0.075
0.754CysAsn: 0.754 ± 0.158
0.35CysPro: 0.35 ± 0.118
0.242CysGln: 0.242 ± 0.079
0.458CysArg: 0.458 ± 0.13
0.646CysSer: 0.646 ± 0.138
0.431CysThr: 0.431 ± 0.125
0.566CysVal: 0.566 ± 0.16
0.162CysTrp: 0.162 ± 0.064
0.35CysTyr: 0.35 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
2.963AspAla: 2.963 ± 0.338
0.62AspCys: 0.62 ± 0.123
3.825AspAsp: 3.825 ± 0.417
5.387AspGlu: 5.387 ± 0.389
3.206AspPhe: 3.206 ± 0.246
3.933AspGly: 3.933 ± 0.355
0.943AspHis: 0.943 ± 0.179
5.522AspIle: 5.522 ± 0.377
5.818AspLys: 5.818 ± 0.445
5.63AspLeu: 5.63 ± 0.414
1.347AspMet: 1.347 ± 0.168
3.529AspAsn: 3.529 ± 0.296
1.939AspPro: 1.939 ± 0.185
2.128AspGln: 2.128 ± 0.22
1.913AspArg: 1.913 ± 0.302
3.717AspSer: 3.717 ± 0.346
2.586AspThr: 2.586 ± 0.252
3.771AspVal: 3.771 ± 0.36
0.593AspTrp: 0.593 ± 0.126
3.717AspTyr: 3.717 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
4.229GluAla: 4.229 ± 0.423
0.754GluCys: 0.754 ± 0.154
4.795GluAsp: 4.795 ± 0.302
7.004GluGlu: 7.004 ± 0.434
3.206GluPhe: 3.206 ± 0.303
3.96GluGly: 3.96 ± 0.422
1.724GluHis: 1.724 ± 0.245
7.004GluIle: 7.004 ± 0.391
7.785GluLys: 7.785 ± 0.481
8.378GluLeu: 8.378 ± 0.586
2.451GluMet: 2.451 ± 0.243
4.364GluAsn: 4.364 ± 0.366
1.562GluPro: 1.562 ± 0.243
3.394GluGln: 3.394 ± 0.345
3.556GluArg: 3.556 ± 0.398
3.906GluSer: 3.906 ± 0.298
3.744GluThr: 3.744 ± 0.319
5.414GluVal: 5.414 ± 0.447
1.158GluTrp: 1.158 ± 0.209
3.771GluTyr: 3.771 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
1.778PheAla: 1.778 ± 0.208
0.404PheCys: 0.404 ± 0.121
3.098PheAsp: 3.098 ± 0.252
3.825PheGlu: 3.825 ± 0.395
2.047PhePhe: 2.047 ± 0.279
2.397PheGly: 2.397 ± 0.251
0.97PheHis: 0.97 ± 0.178
3.098PheIle: 3.098 ± 0.309
4.633PheLys: 4.633 ± 0.4
2.963PheLeu: 2.963 ± 0.33
0.808PheMet: 0.808 ± 0.172
3.206PheAsn: 3.206 ± 0.218
1.212PhePro: 1.212 ± 0.166
0.943PheGln: 0.943 ± 0.158
1.724PheArg: 1.724 ± 0.238
3.071PheSer: 3.071 ± 0.269
2.074PheThr: 2.074 ± 0.211
2.37PheVal: 2.37 ± 0.328
0.215PheTrp: 0.215 ± 0.076
2.236PheTyr: 2.236 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
2.424GlyAla: 2.424 ± 0.259
0.539GlyCys: 0.539 ± 0.146
3.232GlyAsp: 3.232 ± 0.359
4.202GlyGlu: 4.202 ± 0.345
3.017GlyPhe: 3.017 ± 0.273
2.694GlyGly: 2.694 ± 0.296
1.158GlyHis: 1.158 ± 0.211
4.041GlyIle: 4.041 ± 0.395
4.903GlyLys: 4.903 ± 0.296
4.606GlyLeu: 4.606 ± 0.33
1.266GlyMet: 1.266 ± 0.218
3.583GlyAsn: 3.583 ± 0.387
0.108GlyPro: 0.108 ± 0.051
1.859GlyGln: 1.859 ± 0.209
1.67GlyArg: 1.67 ± 0.213
3.69GlySer: 3.69 ± 0.342
3.313GlyThr: 3.313 ± 0.326
3.529GlyVal: 3.529 ± 0.319
0.485GlyTrp: 0.485 ± 0.128
2.882GlyTyr: 2.882 ± 0.266
0.0GlyXaa: 0.0 ± 0.0
His
0.646HisAla: 0.646 ± 0.134
0.242HisCys: 0.242 ± 0.087
0.97HisAsp: 0.97 ± 0.137
1.643HisGlu: 1.643 ± 0.222
0.646HisPhe: 0.646 ± 0.121
1.051HisGly: 1.051 ± 0.16
0.431HisHis: 0.431 ± 0.097
1.347HisIle: 1.347 ± 0.227
1.832HisLys: 1.832 ± 0.249
1.616HisLeu: 1.616 ± 0.195
0.458HisMet: 0.458 ± 0.137
1.482HisAsn: 1.482 ± 0.238
0.754HisPro: 0.754 ± 0.132
0.539HisGln: 0.539 ± 0.127
0.889HisArg: 0.889 ± 0.162
1.508HisSer: 1.508 ± 0.236
1.104HisThr: 1.104 ± 0.196
1.024HisVal: 1.024 ± 0.173
0.108HisTrp: 0.108 ± 0.054
0.727HisTyr: 0.727 ± 0.133
0.0HisXaa: 0.0 ± 0.0
Ile
3.879IleAla: 3.879 ± 0.321
0.7IleCys: 0.7 ± 0.152
5.387IleAsp: 5.387 ± 0.344
6.546IleGlu: 6.546 ± 0.445
2.37IlePhe: 2.37 ± 0.228
4.014IleGly: 4.014 ± 0.304
2.074IleHis: 2.074 ± 0.273
4.094IleIle: 4.094 ± 0.369
8.243IleLys: 8.243 ± 0.434
5.037IleLeu: 5.037 ± 0.411
1.266IleMet: 1.266 ± 0.146
5.765IleAsn: 5.765 ± 0.461
2.263IlePro: 2.263 ± 0.228
2.505IleGln: 2.505 ± 0.326
2.748IleArg: 2.748 ± 0.265
5.414IleSer: 5.414 ± 0.553
4.041IleThr: 4.041 ± 0.381
4.445IleVal: 4.445 ± 0.416
0.916IleTrp: 0.916 ± 0.144
2.209IleTyr: 2.209 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
5.603LysAla: 5.603 ± 0.438
1.051LysCys: 1.051 ± 0.224
5.684LysAsp: 5.684 ± 0.403
8.835LysGlu: 8.835 ± 0.59
3.367LysPhe: 3.367 ± 0.33
5.441LysGly: 5.441 ± 0.436
1.616LysHis: 1.616 ± 0.221
6.977LysIle: 6.977 ± 0.408
10.048LysLys: 10.048 ± 0.658
8.351LysLeu: 8.351 ± 0.497
2.344LysMet: 2.344 ± 0.251
6.142LysAsn: 6.142 ± 0.452
2.263LysPro: 2.263 ± 0.3
4.175LysGln: 4.175 ± 0.475
4.552LysArg: 4.552 ± 0.48
5.711LysSer: 5.711 ± 0.489
5.172LysThr: 5.172 ± 0.426
6.061LysVal: 6.061 ± 0.443
0.781LysTrp: 0.781 ± 0.148
4.741LysTyr: 4.741 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
4.148LeuAla: 4.148 ± 0.362
0.727LeuCys: 0.727 ± 0.157
5.387LeuAsp: 5.387 ± 0.38
6.223LeuGlu: 6.223 ± 0.457
3.852LeuPhe: 3.852 ± 0.369
3.61LeuGly: 3.61 ± 0.306
1.535LeuHis: 1.535 ± 0.189
6.249LeuIle: 6.249 ± 0.422
9.051LeuLys: 9.051 ± 0.542
7.327LeuLeu: 7.327 ± 0.47
2.182LeuMet: 2.182 ± 0.251
6.842LeuAsn: 6.842 ± 0.447
2.236LeuPro: 2.236 ± 0.261
3.259LeuGln: 3.259 ± 0.432
3.556LeuArg: 3.556 ± 0.306
6.034LeuSer: 6.034 ± 0.422
5.118LeuThr: 5.118 ± 0.384
4.202LeuVal: 4.202 ± 0.339
0.7LeuTrp: 0.7 ± 0.159
3.206LeuTyr: 3.206 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
1.374MetAla: 1.374 ± 0.18
0.108MetCys: 0.108 ± 0.06
1.455MetAsp: 1.455 ± 0.169
1.832MetGlu: 1.832 ± 0.207
0.943MetPhe: 0.943 ± 0.163
1.185MetGly: 1.185 ± 0.161
0.269MetHis: 0.269 ± 0.083
1.751MetIle: 1.751 ± 0.239
2.721MetLys: 2.721 ± 0.28
1.966MetLeu: 1.966 ± 0.223
0.539MetMet: 0.539 ± 0.135
1.697MetAsn: 1.697 ± 0.218
0.727MetPro: 0.727 ± 0.131
0.862MetGln: 0.862 ± 0.2
0.943MetArg: 0.943 ± 0.144
1.859MetSer: 1.859 ± 0.279
1.266MetThr: 1.266 ± 0.2
1.024MetVal: 1.024 ± 0.148
0.269MetTrp: 0.269 ± 0.087
0.62MetTyr: 0.62 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
3.421AsnAla: 3.421 ± 0.428
0.835AsnCys: 0.835 ± 0.181
3.798AsnAsp: 3.798 ± 0.325
5.657AsnGlu: 5.657 ± 0.447
2.586AsnPhe: 2.586 ± 0.246
3.663AsnGly: 3.663 ± 0.314
1.024AsnHis: 1.024 ± 0.167
4.552AsnIle: 4.552 ± 0.344
7.031AsnLys: 7.031 ± 0.533
5.28AsnLeu: 5.28 ± 0.349
1.401AsnMet: 1.401 ± 0.17
4.175AsnAsn: 4.175 ± 0.339
1.832AsnPro: 1.832 ± 0.21
2.074AsnGln: 2.074 ± 0.302
2.505AsnArg: 2.505 ± 0.209
4.175AsnSer: 4.175 ± 0.389
3.098AsnThr: 3.098 ± 0.415
3.556AsnVal: 3.556 ± 0.286
0.539AsnTrp: 0.539 ± 0.126
2.478AsnTyr: 2.478 ± 0.302
0.0AsnXaa: 0.0 ± 0.0
Pro
1.374ProAla: 1.374 ± 0.161
0.162ProCys: 0.162 ± 0.062
2.236ProAsp: 2.236 ± 0.236
2.02ProGlu: 2.02 ± 0.242
1.104ProPhe: 1.104 ± 0.166
1.024ProGly: 1.024 ± 0.175
1.024ProHis: 1.024 ± 0.184
1.482ProIle: 1.482 ± 0.243
2.559ProLys: 2.559 ± 0.268
2.451ProLeu: 2.451 ± 0.302
0.431ProMet: 0.431 ± 0.099
1.482ProAsn: 1.482 ± 0.193
0.566ProPro: 0.566 ± 0.14
0.7ProGln: 0.7 ± 0.126
0.835ProArg: 0.835 ± 0.166
2.02ProSer: 2.02 ± 0.211
1.482ProThr: 1.482 ± 0.191
1.32ProVal: 1.32 ± 0.208
0.296ProTrp: 0.296 ± 0.099
1.32ProTyr: 1.32 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.209GlnAla: 2.209 ± 0.505
0.242GlnCys: 0.242 ± 0.077
1.724GlnAsp: 1.724 ± 0.237
3.125GlnGlu: 3.125 ± 0.388
1.347GlnPhe: 1.347 ± 0.195
1.508GlnGly: 1.508 ± 0.236
0.593GlnHis: 0.593 ± 0.119
2.532GlnIle: 2.532 ± 0.268
3.206GlnLys: 3.206 ± 0.484
3.663GlnLeu: 3.663 ± 0.372
1.185GlnMet: 1.185 ± 0.145
2.317GlnAsn: 2.317 ± 0.302
0.727GlnPro: 0.727 ± 0.122
1.751GlnGln: 1.751 ± 0.501
1.212GlnArg: 1.212 ± 0.166
2.236GlnSer: 2.236 ± 0.209
1.508GlnThr: 1.508 ± 0.238
2.317GlnVal: 2.317 ± 0.285
0.485GlnTrp: 0.485 ± 0.111
1.535GlnTyr: 1.535 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
1.67ArgAla: 1.67 ± 0.184
0.296ArgCys: 0.296 ± 0.109
2.182ArgAsp: 2.182 ± 0.225
3.179ArgGlu: 3.179 ± 0.367
1.805ArgPhe: 1.805 ± 0.217
2.182ArgGly: 2.182 ± 0.292
0.862ArgHis: 0.862 ± 0.166
2.828ArgIle: 2.828 ± 0.257
3.529ArgLys: 3.529 ± 0.303
3.421ArgLeu: 3.421 ± 0.349
1.104ArgMet: 1.104 ± 0.165
2.748ArgAsn: 2.748 ± 0.322
0.97ArgPro: 0.97 ± 0.175
1.562ArgGln: 1.562 ± 0.191
1.697ArgArg: 1.697 ± 0.263
2.047ArgSer: 2.047 ± 0.218
1.832ArgThr: 1.832 ± 0.246
2.397ArgVal: 2.397 ± 0.277
0.539ArgTrp: 0.539 ± 0.132
1.832ArgTyr: 1.832 ± 0.238
0.0ArgXaa: 0.0 ± 0.0
Ser
3.152SerAla: 3.152 ± 0.531
0.646SerCys: 0.646 ± 0.162
4.418SerAsp: 4.418 ± 0.365
4.983SerGlu: 4.983 ± 0.338
3.232SerPhe: 3.232 ± 0.29
3.556SerGly: 3.556 ± 0.338
0.997SerHis: 0.997 ± 0.165
4.956SerIle: 4.956 ± 0.436
5.765SerLys: 5.765 ± 0.464
6.223SerLeu: 6.223 ± 0.492
1.455SerMet: 1.455 ± 0.195
4.121SerAsn: 4.121 ± 0.298
2.101SerPro: 2.101 ± 0.274
2.02SerGln: 2.02 ± 0.303
2.236SerArg: 2.236 ± 0.239
5.226SerSer: 5.226 ± 0.714
3.502SerThr: 3.502 ± 0.377
4.041SerVal: 4.041 ± 0.289
0.673SerTrp: 0.673 ± 0.108
2.29SerTyr: 2.29 ± 0.199
0.0SerXaa: 0.0 ± 0.0
Thr
3.044ThrAla: 3.044 ± 0.496
0.269ThrCys: 0.269 ± 0.075
2.99ThrAsp: 2.99 ± 0.298
4.364ThrGlu: 4.364 ± 0.312
1.993ThrPhe: 1.993 ± 0.232
3.475ThrGly: 3.475 ± 0.354
0.673ThrHis: 0.673 ± 0.152
3.852ThrIle: 3.852 ± 0.318
4.579ThrLys: 4.579 ± 0.351
4.175ThrLeu: 4.175 ± 0.312
1.077ThrMet: 1.077 ± 0.164
2.559ThrAsn: 2.559 ± 0.261
1.913ThrPro: 1.913 ± 0.225
2.074ThrGln: 2.074 ± 0.252
1.778ThrArg: 1.778 ± 0.173
3.502ThrSer: 3.502 ± 0.508
2.882ThrThr: 2.882 ± 0.42
4.337ThrVal: 4.337 ± 0.306
0.512ThrTrp: 0.512 ± 0.109
2.882ThrTyr: 2.882 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
3.475ValAla: 3.475 ± 0.275
0.7ValCys: 0.7 ± 0.144
3.906ValAsp: 3.906 ± 0.317
4.768ValGlu: 4.768 ± 0.351
2.559ValPhe: 2.559 ± 0.313
3.232ValGly: 3.232 ± 0.319
1.185ValHis: 1.185 ± 0.2
4.175ValIle: 4.175 ± 0.368
6.061ValLys: 6.061 ± 0.386
4.66ValLeu: 4.66 ± 0.387
1.266ValMet: 1.266 ± 0.191
3.825ValAsn: 3.825 ± 0.349
1.724ValPro: 1.724 ± 0.224
2.074ValGln: 2.074 ± 0.263
2.209ValArg: 2.209 ± 0.217
3.61ValSer: 3.61 ± 0.333
3.637ValThr: 3.637 ± 0.344
3.152ValVal: 3.152 ± 0.252
0.539ValTrp: 0.539 ± 0.13
2.855ValTyr: 2.855 ± 0.323
0.0ValXaa: 0.0 ± 0.0
Trp
0.35TrpAla: 0.35 ± 0.095
0.189TrpCys: 0.189 ± 0.074
0.835TrpAsp: 0.835 ± 0.14
0.943TrpGlu: 0.943 ± 0.161
0.404TrpPhe: 0.404 ± 0.09
0.404TrpGly: 0.404 ± 0.095
0.135TrpHis: 0.135 ± 0.055
0.835TrpIle: 0.835 ± 0.168
0.835TrpLys: 0.835 ± 0.163
0.835TrpLeu: 0.835 ± 0.144
0.135TrpMet: 0.135 ± 0.064
0.485TrpAsn: 0.485 ± 0.131
0.135TrpPro: 0.135 ± 0.05
0.296TrpGln: 0.296 ± 0.097
0.458TrpArg: 0.458 ± 0.108
0.754TrpSer: 0.754 ± 0.122
0.512TrpThr: 0.512 ± 0.111
0.7TrpVal: 0.7 ± 0.141
0.081TrpTrp: 0.081 ± 0.052
0.593TrpTyr: 0.593 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.616TyrAla: 1.616 ± 0.188
0.512TyrCys: 0.512 ± 0.114
3.394TyrAsp: 3.394 ± 0.295
3.771TyrGlu: 3.771 ± 0.368
2.344TyrPhe: 2.344 ± 0.291
2.451TyrGly: 2.451 ± 0.302
0.673TyrHis: 0.673 ± 0.129
3.34TyrIle: 3.34 ± 0.299
4.606TyrLys: 4.606 ± 0.326
3.906TyrLeu: 3.906 ± 0.311
0.97TyrMet: 0.97 ± 0.149
2.344TyrAsn: 2.344 ± 0.232
1.158TyrPro: 1.158 ± 0.159
1.455TyrGln: 1.455 ± 0.155
2.02TyrArg: 2.02 ± 0.271
3.098TyrSer: 3.098 ± 0.299
2.613TyrThr: 2.613 ± 0.218
2.317TyrVal: 2.317 ± 0.28
0.377TyrTrp: 0.377 ± 0.092
2.101TyrTyr: 2.101 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 188 proteins (37124 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski