Amino acid dipepetide frequency for Escherichia phage SUSP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.831AlaAla: 5.831 ± 0.684
0.71AlaCys: 0.71 ± 0.169
4.149AlaAsp: 4.149 ± 0.437
4.224AlaGlu: 4.224 ± 0.439
2.729AlaPhe: 2.729 ± 0.369
5.457AlaGly: 5.457 ± 0.538
1.607AlaHis: 1.607 ± 0.298
5.121AlaIle: 5.121 ± 0.47
5.831AlaLys: 5.831 ± 0.473
6.093AlaLeu: 6.093 ± 0.535
2.803AlaMet: 2.803 ± 0.326
3.775AlaAsn: 3.775 ± 0.55
1.57AlaPro: 1.57 ± 0.256
2.729AlaGln: 2.729 ± 0.364
2.579AlaArg: 2.579 ± 0.368
4.485AlaSer: 4.485 ± 0.5
4.71AlaThr: 4.71 ± 0.507
4.747AlaVal: 4.747 ± 0.407
1.047AlaTrp: 1.047 ± 0.191
2.878AlaTyr: 2.878 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
0.523CysAla: 0.523 ± 0.153
0.224CysCys: 0.224 ± 0.099
0.449CysAsp: 0.449 ± 0.115
0.897CysGlu: 0.897 ± 0.176
0.635CysPhe: 0.635 ± 0.164
1.271CysGly: 1.271 ± 0.199
0.374CysHis: 0.374 ± 0.11
0.523CysIle: 0.523 ± 0.148
1.532CysLys: 1.532 ± 0.261
0.748CysLeu: 0.748 ± 0.151
0.336CysMet: 0.336 ± 0.096
0.635CysAsn: 0.635 ± 0.171
0.71CysPro: 0.71 ± 0.157
0.486CysGln: 0.486 ± 0.136
0.748CysArg: 0.748 ± 0.182
1.084CysSer: 1.084 ± 0.224
0.635CysThr: 0.635 ± 0.142
1.047CysVal: 1.047 ± 0.176
0.075CysTrp: 0.075 ± 0.055
0.523CysTyr: 0.523 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
5.195AspAla: 5.195 ± 0.496
0.748AspCys: 0.748 ± 0.188
3.813AspAsp: 3.813 ± 0.571
4.71AspGlu: 4.71 ± 0.487
3.289AspPhe: 3.289 ± 0.329
5.046AspGly: 5.046 ± 0.368
0.785AspHis: 0.785 ± 0.171
4.298AspIle: 4.298 ± 0.363
4.523AspLys: 4.523 ± 0.392
4.971AspLeu: 4.971 ± 0.462
1.607AspMet: 1.607 ± 0.227
3.551AspAsn: 3.551 ± 0.32
1.607AspPro: 1.607 ± 0.311
0.748AspGln: 0.748 ± 0.196
1.869AspArg: 1.869 ± 0.235
4.074AspSer: 4.074 ± 0.349
3.588AspThr: 3.588 ± 0.353
4.224AspVal: 4.224 ± 0.391
1.346AspTrp: 1.346 ± 0.23
2.28AspTyr: 2.28 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
5.943GluAla: 5.943 ± 0.613
0.785GluCys: 0.785 ± 0.181
4.448GluAsp: 4.448 ± 0.393
4.672GluGlu: 4.672 ± 0.46
3.14GluPhe: 3.14 ± 0.373
3.887GluGly: 3.887 ± 0.366
1.458GluHis: 1.458 ± 0.215
4.597GluIle: 4.597 ± 0.482
4.523GluLys: 4.523 ± 0.518
5.644GluLeu: 5.644 ± 0.362
2.878GluMet: 2.878 ± 0.326
3.364GluAsn: 3.364 ± 0.306
1.906GluPro: 1.906 ± 0.257
2.467GluGln: 2.467 ± 0.347
3.177GluArg: 3.177 ± 0.358
3.551GluSer: 3.551 ± 0.429
3.065GluThr: 3.065 ± 0.293
4.635GluVal: 4.635 ± 0.354
0.822GluTrp: 0.822 ± 0.168
2.654GluTyr: 2.654 ± 0.306
0.0GluXaa: 0.0 ± 0.0
Phe
2.691PheAla: 2.691 ± 0.393
0.486PheCys: 0.486 ± 0.141
3.14PheAsp: 3.14 ± 0.316
3.513PheGlu: 3.513 ± 0.385
1.57PhePhe: 1.57 ± 0.23
3.401PheGly: 3.401 ± 0.418
0.86PheHis: 0.86 ± 0.201
2.691PheIle: 2.691 ± 0.297
3.289PheLys: 3.289 ± 0.363
2.729PheLeu: 2.729 ± 0.368
1.233PheMet: 1.233 ± 0.206
2.243PheAsn: 2.243 ± 0.255
1.233PhePro: 1.233 ± 0.203
1.383PheGln: 1.383 ± 0.214
1.495PheArg: 1.495 ± 0.2
3.028PheSer: 3.028 ± 0.312
2.953PheThr: 2.953 ± 0.359
2.729PheVal: 2.729 ± 0.309
0.411PheTrp: 0.411 ± 0.13
1.944PheTyr: 1.944 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
5.121GlyAla: 5.121 ± 0.547
1.009GlyCys: 1.009 ± 0.183
3.663GlyAsp: 3.663 ± 0.368
4.822GlyGlu: 4.822 ± 0.524
3.401GlyPhe: 3.401 ± 0.424
4.261GlyGly: 4.261 ± 0.528
1.233GlyHis: 1.233 ± 0.239
3.738GlyIle: 3.738 ± 0.444
5.756GlyLys: 5.756 ± 0.493
4.859GlyLeu: 4.859 ± 0.419
1.607GlyMet: 1.607 ± 0.263
3.401GlyAsn: 3.401 ± 0.443
0.075GlyPro: 0.075 ± 0.054
2.056GlyGln: 2.056 ± 0.307
2.803GlyArg: 2.803 ± 0.333
4.336GlySer: 4.336 ± 0.525
4.149GlyThr: 4.149 ± 0.472
4.896GlyVal: 4.896 ± 0.393
0.785GlyTrp: 0.785 ± 0.158
3.364GlyTyr: 3.364 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
0.897HisAla: 0.897 ± 0.137
0.449HisCys: 0.449 ± 0.14
1.121HisAsp: 1.121 ± 0.241
0.897HisGlu: 0.897 ± 0.17
0.822HisPhe: 0.822 ± 0.169
0.897HisGly: 0.897 ± 0.186
0.523HisHis: 0.523 ± 0.174
1.495HisIle: 1.495 ± 0.217
1.458HisLys: 1.458 ± 0.283
1.532HisLeu: 1.532 ± 0.254
0.673HisMet: 0.673 ± 0.167
0.785HisAsn: 0.785 ± 0.192
0.822HisPro: 0.822 ± 0.159
0.635HisGln: 0.635 ± 0.139
1.047HisArg: 1.047 ± 0.158
1.57HisSer: 1.57 ± 0.322
1.383HisThr: 1.383 ± 0.342
1.532HisVal: 1.532 ± 0.242
0.224HisTrp: 0.224 ± 0.13
1.196HisTyr: 1.196 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
4.448IleAla: 4.448 ± 0.444
0.934IleCys: 0.934 ± 0.165
4.112IleAsp: 4.112 ± 0.385
4.186IleGlu: 4.186 ± 0.473
2.729IlePhe: 2.729 ± 0.28
3.513IleGly: 3.513 ± 0.406
1.271IleHis: 1.271 ± 0.289
2.878IleIle: 2.878 ± 0.373
4.672IleLys: 4.672 ± 0.515
3.775IleLeu: 3.775 ± 0.386
1.607IleMet: 1.607 ± 0.268
3.252IleAsn: 3.252 ± 0.327
1.981IlePro: 1.981 ± 0.318
1.981IleGln: 1.981 ± 0.29
2.654IleArg: 2.654 ± 0.24
3.887IleSer: 3.887 ± 0.308
3.738IleThr: 3.738 ± 0.404
4.112IleVal: 4.112 ± 0.365
0.561IleTrp: 0.561 ± 0.118
2.915IleTyr: 2.915 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
6.167LysAla: 6.167 ± 0.564
0.934LysCys: 0.934 ± 0.206
5.009LysAsp: 5.009 ± 0.436
5.906LysGlu: 5.906 ± 0.56
3.028LysPhe: 3.028 ± 0.347
5.195LysGly: 5.195 ± 0.544
1.495LysHis: 1.495 ± 0.277
4.224LysIle: 4.224 ± 0.381
6.242LysLys: 6.242 ± 0.494
5.943LysLeu: 5.943 ± 0.394
2.467LysMet: 2.467 ± 0.253
3.252LysAsn: 3.252 ± 0.395
2.803LysPro: 2.803 ± 0.357
2.915LysGln: 2.915 ± 0.365
3.626LysArg: 3.626 ± 0.359
4.56LysSer: 4.56 ± 0.384
4.934LysThr: 4.934 ± 0.408
5.943LysVal: 5.943 ± 0.55
0.673LysTrp: 0.673 ± 0.162
3.102LysTyr: 3.102 ± 0.349
0.0LysXaa: 0.0 ± 0.0
Leu
5.98LeuAla: 5.98 ± 0.511
1.233LeuCys: 1.233 ± 0.201
5.681LeuAsp: 5.681 ± 0.504
5.681LeuGlu: 5.681 ± 0.493
2.99LeuPhe: 2.99 ± 0.317
4.112LeuGly: 4.112 ± 0.323
1.458LeuHis: 1.458 ± 0.251
3.588LeuIle: 3.588 ± 0.36
6.728LeuLys: 6.728 ± 0.585
6.093LeuLeu: 6.093 ± 0.507
2.243LeuMet: 2.243 ± 0.275
4.373LeuAsn: 4.373 ± 0.427
2.915LeuPro: 2.915 ± 0.364
3.214LeuGln: 3.214 ± 0.365
3.364LeuArg: 3.364 ± 0.379
4.971LeuSer: 4.971 ± 0.335
5.382LeuThr: 5.382 ± 0.429
3.775LeuVal: 3.775 ± 0.371
0.822LeuTrp: 0.822 ± 0.205
3.028LeuTyr: 3.028 ± 0.326
0.0LeuXaa: 0.0 ± 0.0
Met
2.205MetAla: 2.205 ± 0.296
0.262MetCys: 0.262 ± 0.104
1.271MetAsp: 1.271 ± 0.203
1.42MetGlu: 1.42 ± 0.202
1.308MetPhe: 1.308 ± 0.231
1.719MetGly: 1.719 ± 0.241
0.374MetHis: 0.374 ± 0.118
2.056MetIle: 2.056 ± 0.273
3.588MetLys: 3.588 ± 0.359
2.579MetLeu: 2.579 ± 0.287
0.71MetMet: 0.71 ± 0.173
1.757MetAsn: 1.757 ± 0.288
0.86MetPro: 0.86 ± 0.16
1.607MetGln: 1.607 ± 0.28
1.495MetArg: 1.495 ± 0.24
1.869MetSer: 1.869 ± 0.296
2.392MetThr: 2.392 ± 0.295
1.346MetVal: 1.346 ± 0.231
0.224MetTrp: 0.224 ± 0.077
0.897MetTyr: 0.897 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
3.775AsnAla: 3.775 ± 0.348
0.785AsnCys: 0.785 ± 0.179
2.542AsnAsp: 2.542 ± 0.274
2.654AsnGlu: 2.654 ± 0.328
2.43AsnPhe: 2.43 ± 0.252
3.775AsnGly: 3.775 ± 0.346
1.233AsnHis: 1.233 ± 0.248
3.177AsnIle: 3.177 ± 0.311
4.037AsnLys: 4.037 ± 0.413
4.485AsnLeu: 4.485 ± 0.396
1.196AsnMet: 1.196 ± 0.202
2.99AsnAsn: 2.99 ± 0.382
1.944AsnPro: 1.944 ± 0.247
1.869AsnGln: 1.869 ± 0.273
2.168AsnArg: 2.168 ± 0.262
3.14AsnSer: 3.14 ± 0.321
3.065AsnThr: 3.065 ± 0.548
3.925AsnVal: 3.925 ± 0.259
0.673AsnTrp: 0.673 ± 0.123
1.944AsnTyr: 1.944 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
1.719ProAla: 1.719 ± 0.248
0.336ProCys: 0.336 ± 0.103
2.243ProAsp: 2.243 ± 0.287
2.616ProGlu: 2.616 ± 0.34
1.719ProPhe: 1.719 ± 0.264
0.336ProGly: 0.336 ± 0.122
0.71ProHis: 0.71 ± 0.194
1.233ProIle: 1.233 ± 0.176
2.43ProLys: 2.43 ± 0.277
2.205ProLeu: 2.205 ± 0.304
1.084ProMet: 1.084 ± 0.21
1.346ProAsn: 1.346 ± 0.218
1.009ProPro: 1.009 ± 0.225
1.196ProGln: 1.196 ± 0.213
1.121ProArg: 1.121 ± 0.174
2.093ProSer: 2.093 ± 0.373
2.355ProThr: 2.355 ± 0.283
2.243ProVal: 2.243 ± 0.325
0.262ProTrp: 0.262 ± 0.094
1.42ProTyr: 1.42 ± 0.233
0.0ProXaa: 0.0 ± 0.0
Gln
2.28GlnAla: 2.28 ± 0.318
0.449GlnCys: 0.449 ± 0.135
1.495GlnAsp: 1.495 ± 0.228
2.841GlnGlu: 2.841 ± 0.299
1.458GlnPhe: 1.458 ± 0.226
2.168GlnGly: 2.168 ± 0.359
0.598GlnHis: 0.598 ± 0.13
2.766GlnIle: 2.766 ± 0.298
2.243GlnLys: 2.243 ± 0.251
3.14GlnLeu: 3.14 ± 0.372
1.009GlnMet: 1.009 ± 0.226
2.056GlnAsn: 2.056 ± 0.314
1.121GlnPro: 1.121 ± 0.184
1.495GlnGln: 1.495 ± 0.237
1.757GlnArg: 1.757 ± 0.229
1.944GlnSer: 1.944 ± 0.272
2.131GlnThr: 2.131 ± 0.319
2.99GlnVal: 2.99 ± 0.359
0.523GlnTrp: 0.523 ± 0.139
1.906GlnTyr: 1.906 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
2.729ArgAla: 2.729 ± 0.331
0.523ArgCys: 0.523 ± 0.139
2.915ArgAsp: 2.915 ± 0.327
2.355ArgGlu: 2.355 ± 0.338
1.906ArgPhe: 1.906 ± 0.335
2.766ArgGly: 2.766 ± 0.296
0.897ArgHis: 0.897 ± 0.217
2.504ArgIle: 2.504 ± 0.25
3.102ArgLys: 3.102 ± 0.362
3.999ArgLeu: 3.999 ± 0.367
1.532ArgMet: 1.532 ± 0.281
2.205ArgAsn: 2.205 ± 0.218
1.084ArgPro: 1.084 ± 0.219
1.981ArgGln: 1.981 ± 0.265
2.093ArgArg: 2.093 ± 0.301
2.355ArgSer: 2.355 ± 0.267
2.43ArgThr: 2.43 ± 0.273
2.766ArgVal: 2.766 ± 0.266
0.598ArgTrp: 0.598 ± 0.152
2.243ArgTyr: 2.243 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
4.411SerAla: 4.411 ± 0.497
0.822SerCys: 0.822 ± 0.193
4.635SerAsp: 4.635 ± 0.401
4.373SerGlu: 4.373 ± 0.364
2.691SerPhe: 2.691 ± 0.331
4.224SerGly: 4.224 ± 0.454
1.532SerHis: 1.532 ± 0.237
3.102SerIle: 3.102 ± 0.36
3.999SerLys: 3.999 ± 0.294
5.233SerLeu: 5.233 ± 0.393
1.906SerMet: 1.906 ± 0.245
3.14SerAsn: 3.14 ± 0.395
1.794SerPro: 1.794 ± 0.298
2.915SerGln: 2.915 ± 0.333
2.691SerArg: 2.691 ± 0.254
3.85SerSer: 3.85 ± 0.42
2.99SerThr: 2.99 ± 0.316
4.448SerVal: 4.448 ± 0.434
0.897SerTrp: 0.897 ± 0.176
2.691SerTyr: 2.691 ± 0.255
0.0SerXaa: 0.0 ± 0.0
Thr
4.822ThrAla: 4.822 ± 0.48
0.785ThrCys: 0.785 ± 0.169
3.364ThrAsp: 3.364 ± 0.348
3.439ThrGlu: 3.439 ± 0.439
2.841ThrPhe: 2.841 ± 0.285
5.719ThrGly: 5.719 ± 0.597
1.346ThrHis: 1.346 ± 0.282
3.999ThrIle: 3.999 ± 0.363
4.896ThrLys: 4.896 ± 0.472
4.448ThrLeu: 4.448 ± 0.399
1.047ThrMet: 1.047 ± 0.161
2.43ThrAsn: 2.43 ± 0.334
2.355ThrPro: 2.355 ± 0.277
2.542ThrGln: 2.542 ± 0.257
2.542ThrArg: 2.542 ± 0.284
3.7ThrSer: 3.7 ± 0.39
3.962ThrThr: 3.962 ± 0.563
4.971ThrVal: 4.971 ± 0.434
0.748ThrTrp: 0.748 ± 0.154
3.252ThrTyr: 3.252 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
5.382ValAla: 5.382 ± 0.493
0.972ValCys: 0.972 ± 0.202
4.672ValAsp: 4.672 ± 0.386
4.373ValGlu: 4.373 ± 0.363
2.018ValPhe: 2.018 ± 0.247
4.149ValGly: 4.149 ± 0.478
1.121ValHis: 1.121 ± 0.193
4.635ValIle: 4.635 ± 0.441
5.27ValLys: 5.27 ± 0.421
4.672ValLeu: 4.672 ± 0.412
2.392ValMet: 2.392 ± 0.279
3.775ValAsn: 3.775 ± 0.334
2.168ValPro: 2.168 ± 0.342
1.944ValGln: 1.944 ± 0.235
3.551ValArg: 3.551 ± 0.395
4.56ValSer: 4.56 ± 0.432
4.971ValThr: 4.971 ± 0.466
4.298ValVal: 4.298 ± 0.445
0.486ValTrp: 0.486 ± 0.141
2.766ValTyr: 2.766 ± 0.4
0.0ValXaa: 0.0 ± 0.0
Trp
0.449TrpAla: 0.449 ± 0.134
0.15TrpCys: 0.15 ± 0.077
0.86TrpAsp: 0.86 ± 0.192
0.897TrpGlu: 0.897 ± 0.17
0.561TrpPhe: 0.561 ± 0.143
0.598TrpGly: 0.598 ± 0.153
0.075TrpHis: 0.075 ± 0.048
0.561TrpIle: 0.561 ± 0.124
0.86TrpLys: 0.86 ± 0.145
1.159TrpLeu: 1.159 ± 0.221
0.411TrpMet: 0.411 ± 0.127
1.009TrpAsn: 1.009 ± 0.228
0.187TrpPro: 0.187 ± 0.072
0.748TrpGln: 0.748 ± 0.152
0.561TrpArg: 0.561 ± 0.147
0.635TrpSer: 0.635 ± 0.144
0.598TrpThr: 0.598 ± 0.151
0.598TrpVal: 0.598 ± 0.119
0.15TrpTrp: 0.15 ± 0.071
0.673TrpTyr: 0.673 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.504TyrAla: 2.504 ± 0.366
0.86TyrCys: 0.86 ± 0.165
2.691TyrAsp: 2.691 ± 0.361
3.14TyrGlu: 3.14 ± 0.301
1.757TyrPhe: 1.757 ± 0.219
2.841TyrGly: 2.841 ± 0.34
1.009TyrHis: 1.009 ± 0.181
2.018TyrIle: 2.018 ± 0.258
3.439TyrLys: 3.439 ± 0.363
3.327TyrLeu: 3.327 ± 0.365
1.159TyrMet: 1.159 ± 0.215
2.467TyrAsn: 2.467 ± 0.291
1.532TyrPro: 1.532 ± 0.22
1.495TyrGln: 1.495 ± 0.236
1.645TyrArg: 1.645 ± 0.25
2.654TyrSer: 2.654 ± 0.3
3.775TyrThr: 3.775 ± 0.361
2.953TyrVal: 2.953 ± 0.29
0.411TyrTrp: 0.411 ± 0.115
2.093TyrTyr: 2.093 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 138 proteins (26755 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski