Amino acid dipepetide frequency for Escherichia phage EcSzw_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.098AlaAla: 6.098 ± 0.57
0.782AlaCys: 0.782 ± 0.165
4.417AlaAsp: 4.417 ± 0.443
4.417AlaGlu: 4.417 ± 0.406
2.932AlaPhe: 2.932 ± 0.366
5.355AlaGly: 5.355 ± 0.5
1.446AlaHis: 1.446 ± 0.258
5.042AlaIle: 5.042 ± 0.434
5.355AlaLys: 5.355 ± 0.507
6.098AlaLeu: 6.098 ± 0.488
2.736AlaMet: 2.736 ± 0.325
4.339AlaAsn: 4.339 ± 0.455
1.563AlaPro: 1.563 ± 0.278
2.697AlaGln: 2.697 ± 0.398
2.462AlaArg: 2.462 ± 0.385
4.69AlaSer: 4.69 ± 0.501
4.378AlaThr: 4.378 ± 0.437
5.277AlaVal: 5.277 ± 0.423
0.821AlaTrp: 0.821 ± 0.17
2.892AlaTyr: 2.892 ± 0.295
0.0AlaXaa: 0.0 ± 0.0
Cys
0.586CysAla: 0.586 ± 0.187
0.235CysCys: 0.235 ± 0.113
0.664CysAsp: 0.664 ± 0.14
0.899CysGlu: 0.899 ± 0.196
0.625CysPhe: 0.625 ± 0.168
1.094CysGly: 1.094 ± 0.194
0.469CysHis: 0.469 ± 0.146
0.508CysIle: 0.508 ± 0.124
1.212CysLys: 1.212 ± 0.239
0.704CysLeu: 0.704 ± 0.148
0.43CysMet: 0.43 ± 0.119
0.743CysAsn: 0.743 ± 0.173
0.704CysPro: 0.704 ± 0.145
0.391CysGln: 0.391 ± 0.123
0.625CysArg: 0.625 ± 0.172
0.977CysSer: 0.977 ± 0.222
0.782CysThr: 0.782 ± 0.148
0.782CysVal: 0.782 ± 0.167
0.039CysTrp: 0.039 ± 0.038
0.586CysTyr: 0.586 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
5.042AspAla: 5.042 ± 0.431
0.586AspCys: 0.586 ± 0.16
3.87AspAsp: 3.87 ± 0.63
4.417AspGlu: 4.417 ± 0.575
3.44AspPhe: 3.44 ± 0.307
5.003AspGly: 5.003 ± 0.408
0.704AspHis: 0.704 ± 0.174
4.847AspIle: 4.847 ± 0.401
4.573AspLys: 4.573 ± 0.439
4.73AspLeu: 4.73 ± 0.437
1.798AspMet: 1.798 ± 0.219
3.127AspAsn: 3.127 ± 0.372
1.524AspPro: 1.524 ± 0.274
0.782AspGln: 0.782 ± 0.201
2.111AspArg: 2.111 ± 0.246
4.417AspSer: 4.417 ± 0.411
3.791AspThr: 3.791 ± 0.425
4.378AspVal: 4.378 ± 0.457
1.485AspTrp: 1.485 ± 0.276
2.345AspTyr: 2.345 ± 0.313
0.039AspXaa: 0.039 ± 0.036
Glu
5.316GluAla: 5.316 ± 0.606
0.743GluCys: 0.743 ± 0.179
4.104GluAsp: 4.104 ± 0.41
5.003GluGlu: 5.003 ± 0.526
3.01GluPhe: 3.01 ± 0.316
3.987GluGly: 3.987 ± 0.386
1.524GluHis: 1.524 ± 0.222
4.417GluIle: 4.417 ± 0.466
4.69GluLys: 4.69 ± 0.562
6.019GluLeu: 6.019 ± 0.469
2.345GluMet: 2.345 ± 0.277
3.44GluAsn: 3.44 ± 0.446
1.72GluPro: 1.72 ± 0.308
2.111GluGln: 2.111 ± 0.29
2.892GluArg: 2.892 ± 0.346
3.909GluSer: 3.909 ± 0.386
3.205GluThr: 3.205 ± 0.369
4.69GluVal: 4.69 ± 0.37
0.821GluTrp: 0.821 ± 0.19
3.01GluTyr: 3.01 ± 0.347
0.0GluXaa: 0.0 ± 0.0
Phe
2.619PheAla: 2.619 ± 0.394
0.547PheCys: 0.547 ± 0.146
3.088PheAsp: 3.088 ± 0.293
3.557PheGlu: 3.557 ± 0.359
1.407PhePhe: 1.407 ± 0.227
3.596PheGly: 3.596 ± 0.403
0.899PheHis: 0.899 ± 0.199
2.736PheIle: 2.736 ± 0.323
3.401PheLys: 3.401 ± 0.38
3.127PheLeu: 3.127 ± 0.38
1.094PheMet: 1.094 ± 0.195
1.915PheAsn: 1.915 ± 0.296
1.212PhePro: 1.212 ± 0.223
1.524PheGln: 1.524 ± 0.222
1.524PheArg: 1.524 ± 0.244
2.932PheSer: 2.932 ± 0.324
2.58PheThr: 2.58 ± 0.341
2.658PheVal: 2.658 ± 0.32
0.43PheTrp: 0.43 ± 0.122
1.993PheTyr: 1.993 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
5.589GlyAla: 5.589 ± 0.525
1.055GlyCys: 1.055 ± 0.209
3.361GlyAsp: 3.361 ± 0.32
4.651GlyGlu: 4.651 ± 0.519
3.244GlyPhe: 3.244 ± 0.384
4.221GlyGly: 4.221 ± 0.496
1.485GlyHis: 1.485 ± 0.23
3.987GlyIle: 3.987 ± 0.463
6.137GlyLys: 6.137 ± 0.566
4.808GlyLeu: 4.808 ± 0.541
1.524GlyMet: 1.524 ± 0.283
3.244GlyAsn: 3.244 ± 0.413
0.078GlyPro: 0.078 ± 0.055
2.423GlyGln: 2.423 ± 0.355
2.736GlyArg: 2.736 ± 0.314
3.87GlySer: 3.87 ± 0.412
4.104GlyThr: 4.104 ± 0.541
5.199GlyVal: 5.199 ± 0.406
0.86GlyTrp: 0.86 ± 0.19
3.283GlyTyr: 3.283 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
0.899HisAla: 0.899 ± 0.163
0.469HisCys: 0.469 ± 0.156
0.977HisAsp: 0.977 ± 0.212
1.173HisGlu: 1.173 ± 0.203
0.704HisPhe: 0.704 ± 0.167
1.016HisGly: 1.016 ± 0.172
0.704HisHis: 0.704 ± 0.225
1.446HisIle: 1.446 ± 0.256
1.173HisLys: 1.173 ± 0.197
1.876HisLeu: 1.876 ± 0.328
0.664HisMet: 0.664 ± 0.159
0.821HisAsn: 0.821 ± 0.19
0.704HisPro: 0.704 ± 0.174
0.664HisGln: 0.664 ± 0.15
1.016HisArg: 1.016 ± 0.209
1.798HisSer: 1.798 ± 0.375
1.407HisThr: 1.407 ± 0.342
1.524HisVal: 1.524 ± 0.254
0.274HisTrp: 0.274 ± 0.105
1.212HisTyr: 1.212 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
3.909IleAla: 3.909 ± 0.335
0.86IleCys: 0.86 ± 0.209
4.417IleAsp: 4.417 ± 0.43
3.87IleGlu: 3.87 ± 0.433
2.932IlePhe: 2.932 ± 0.331
3.401IleGly: 3.401 ± 0.396
1.212IleHis: 1.212 ± 0.237
2.853IleIle: 2.853 ± 0.369
4.73IleLys: 4.73 ± 0.5
3.948IleLeu: 3.948 ± 0.406
1.524IleMet: 1.524 ± 0.3
2.736IleAsn: 2.736 ± 0.351
2.072IlePro: 2.072 ± 0.326
1.798IleGln: 1.798 ± 0.26
2.502IleArg: 2.502 ± 0.307
4.143IleSer: 4.143 ± 0.448
3.635IleThr: 3.635 ± 0.343
4.3IleVal: 4.3 ± 0.426
0.821IleTrp: 0.821 ± 0.191
3.01IleTyr: 3.01 ± 0.34
0.0IleXaa: 0.0 ± 0.0
Lys
6.176LysAla: 6.176 ± 0.547
1.055LysCys: 1.055 ± 0.238
5.081LysAsp: 5.081 ± 0.539
6.098LysGlu: 6.098 ± 0.545
2.697LysPhe: 2.697 ± 0.333
5.003LysGly: 5.003 ± 0.525
1.837LysHis: 1.837 ± 0.312
4.3LysIle: 4.3 ± 0.464
5.472LysLys: 5.472 ± 0.524
5.785LysLeu: 5.785 ± 0.381
2.423LysMet: 2.423 ± 0.363
3.244LysAsn: 3.244 ± 0.327
2.658LysPro: 2.658 ± 0.31
2.736LysGln: 2.736 ± 0.315
3.401LysArg: 3.401 ± 0.33
4.3LysSer: 4.3 ± 0.44
4.769LysThr: 4.769 ± 0.492
6.957LysVal: 6.957 ± 0.55
0.743LysTrp: 0.743 ± 0.183
2.541LysTyr: 2.541 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
6.528LeuAla: 6.528 ± 0.506
1.016LeuCys: 1.016 ± 0.204
5.746LeuAsp: 5.746 ± 0.494
5.746LeuGlu: 5.746 ± 0.483
3.244LeuPhe: 3.244 ± 0.352
4.495LeuGly: 4.495 ± 0.409
1.642LeuHis: 1.642 ± 0.288
3.557LeuIle: 3.557 ± 0.389
7.036LeuLys: 7.036 ± 0.567
5.902LeuLeu: 5.902 ± 0.525
2.306LeuMet: 2.306 ± 0.313
3.909LeuAsn: 3.909 ± 0.45
2.853LeuPro: 2.853 ± 0.35
3.283LeuGln: 3.283 ± 0.406
3.518LeuArg: 3.518 ± 0.414
4.808LeuSer: 4.808 ± 0.408
5.511LeuThr: 5.511 ± 0.5
4.182LeuVal: 4.182 ± 0.363
0.782LeuTrp: 0.782 ± 0.19
3.088LeuTyr: 3.088 ± 0.357
0.0LeuXaa: 0.0 ± 0.0
Met
2.15MetAla: 2.15 ± 0.308
0.235MetCys: 0.235 ± 0.101
1.173MetAsp: 1.173 ± 0.2
1.642MetGlu: 1.642 ± 0.233
1.212MetPhe: 1.212 ± 0.254
1.798MetGly: 1.798 ± 0.235
0.313MetHis: 0.313 ± 0.111
2.033MetIle: 2.033 ± 0.285
3.674MetLys: 3.674 ± 0.411
2.462MetLeu: 2.462 ± 0.333
0.704MetMet: 0.704 ± 0.183
1.642MetAsn: 1.642 ± 0.277
0.821MetPro: 0.821 ± 0.183
1.368MetGln: 1.368 ± 0.283
1.368MetArg: 1.368 ± 0.201
2.306MetSer: 2.306 ± 0.308
2.111MetThr: 2.111 ± 0.258
1.094MetVal: 1.094 ± 0.165
0.235MetTrp: 0.235 ± 0.096
0.86MetTyr: 0.86 ± 0.213
0.0MetXaa: 0.0 ± 0.0
Asn
3.987AsnAla: 3.987 ± 0.356
0.743AsnCys: 0.743 ± 0.171
2.345AsnAsp: 2.345 ± 0.287
2.423AsnGlu: 2.423 ± 0.314
2.345AsnPhe: 2.345 ± 0.297
3.948AsnGly: 3.948 ± 0.397
1.251AsnHis: 1.251 ± 0.234
2.814AsnIle: 2.814 ± 0.269
3.752AsnLys: 3.752 ± 0.443
4.612AsnLeu: 4.612 ± 0.407
1.212AsnMet: 1.212 ± 0.235
2.892AsnAsn: 2.892 ± 0.331
2.072AsnPro: 2.072 ± 0.28
2.111AsnGln: 2.111 ± 0.297
1.837AsnArg: 1.837 ± 0.285
2.932AsnSer: 2.932 ± 0.292
3.283AsnThr: 3.283 ± 0.543
3.557AsnVal: 3.557 ± 0.351
0.743AsnTrp: 0.743 ± 0.148
2.033AsnTyr: 2.033 ± 0.26
0.0AsnXaa: 0.0 ± 0.0
Pro
1.563ProAla: 1.563 ± 0.256
0.274ProCys: 0.274 ± 0.087
2.033ProAsp: 2.033 ± 0.325
3.01ProGlu: 3.01 ± 0.309
1.563ProPhe: 1.563 ± 0.261
0.352ProGly: 0.352 ± 0.13
0.664ProHis: 0.664 ± 0.193
1.251ProIle: 1.251 ± 0.2
2.58ProLys: 2.58 ± 0.309
2.15ProLeu: 2.15 ± 0.315
1.212ProMet: 1.212 ± 0.262
1.642ProAsn: 1.642 ± 0.28
0.977ProPro: 0.977 ± 0.222
1.134ProGln: 1.134 ± 0.235
1.016ProArg: 1.016 ± 0.211
2.072ProSer: 2.072 ± 0.337
2.384ProThr: 2.384 ± 0.327
1.954ProVal: 1.954 ± 0.296
0.274ProTrp: 0.274 ± 0.08
1.407ProTyr: 1.407 ± 0.229
0.0ProXaa: 0.0 ± 0.0
Gln
2.502GlnAla: 2.502 ± 0.335
0.508GlnCys: 0.508 ± 0.152
1.446GlnAsp: 1.446 ± 0.221
2.502GlnGlu: 2.502 ± 0.325
1.563GlnPhe: 1.563 ± 0.231
2.15GlnGly: 2.15 ± 0.333
0.664GlnHis: 0.664 ± 0.163
2.502GlnIle: 2.502 ± 0.323
2.462GlnLys: 2.462 ± 0.302
2.932GlnLeu: 2.932 ± 0.392
1.094GlnMet: 1.094 ± 0.248
2.306GlnAsn: 2.306 ± 0.303
1.173GlnPro: 1.173 ± 0.213
1.368GlnGln: 1.368 ± 0.232
1.642GlnArg: 1.642 ± 0.237
1.72GlnSer: 1.72 ± 0.276
2.345GlnThr: 2.345 ± 0.389
2.619GlnVal: 2.619 ± 0.268
0.469GlnTrp: 0.469 ± 0.134
1.837GlnTyr: 1.837 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
2.932ArgAla: 2.932 ± 0.309
0.508ArgCys: 0.508 ± 0.145
3.244ArgAsp: 3.244 ± 0.332
2.462ArgGlu: 2.462 ± 0.379
1.72ArgPhe: 1.72 ± 0.326
2.971ArgGly: 2.971 ± 0.345
0.977ArgHis: 0.977 ± 0.2
2.58ArgIle: 2.58 ± 0.294
2.971ArgLys: 2.971 ± 0.284
3.674ArgLeu: 3.674 ± 0.417
1.368ArgMet: 1.368 ± 0.282
2.072ArgAsn: 2.072 ± 0.283
0.938ArgPro: 0.938 ± 0.193
1.837ArgGln: 1.837 ± 0.262
2.033ArgArg: 2.033 ± 0.291
1.993ArgSer: 1.993 ± 0.275
2.541ArgThr: 2.541 ± 0.327
2.502ArgVal: 2.502 ± 0.248
0.547ArgTrp: 0.547 ± 0.173
1.876ArgTyr: 1.876 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
4.221SerAla: 4.221 ± 0.499
0.664SerCys: 0.664 ± 0.151
4.573SerAsp: 4.573 ± 0.383
4.104SerGlu: 4.104 ± 0.423
2.971SerPhe: 2.971 ± 0.333
4.3SerGly: 4.3 ± 0.445
1.329SerHis: 1.329 ± 0.306
3.049SerIle: 3.049 ± 0.33
4.143SerLys: 4.143 ± 0.38
5.589SerLeu: 5.589 ± 0.481
1.72SerMet: 1.72 ± 0.256
2.736SerAsn: 2.736 ± 0.406
1.837SerPro: 1.837 ± 0.308
2.775SerGln: 2.775 ± 0.365
2.658SerArg: 2.658 ± 0.262
4.221SerSer: 4.221 ± 0.458
3.205SerThr: 3.205 ± 0.41
4.925SerVal: 4.925 ± 0.396
0.899SerTrp: 0.899 ± 0.197
2.697SerTyr: 2.697 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
4.925ThrAla: 4.925 ± 0.463
0.547ThrCys: 0.547 ± 0.137
3.987ThrAsp: 3.987 ± 0.515
3.244ThrGlu: 3.244 ± 0.369
2.892ThrPhe: 2.892 ± 0.351
5.433ThrGly: 5.433 ± 0.519
1.407ThrHis: 1.407 ± 0.312
3.674ThrIle: 3.674 ± 0.336
4.417ThrLys: 4.417 ± 0.459
4.26ThrLeu: 4.26 ± 0.508
1.134ThrMet: 1.134 ± 0.237
2.502ThrAsn: 2.502 ± 0.42
2.58ThrPro: 2.58 ± 0.272
2.58ThrGln: 2.58 ± 0.307
2.462ThrArg: 2.462 ± 0.339
3.831ThrSer: 3.831 ± 0.497
4.065ThrThr: 4.065 ± 0.605
5.12ThrVal: 5.12 ± 0.438
0.86ThrTrp: 0.86 ± 0.148
3.322ThrTyr: 3.322 ± 0.453
0.039ThrXaa: 0.039 ± 0.041
Val
5.785ValAla: 5.785 ± 0.486
1.055ValCys: 1.055 ± 0.215
4.886ValAsp: 4.886 ± 0.387
4.143ValGlu: 4.143 ± 0.351
2.15ValPhe: 2.15 ± 0.3
4.3ValGly: 4.3 ± 0.507
1.055ValHis: 1.055 ± 0.222
4.339ValIle: 4.339 ± 0.447
5.472ValLys: 5.472 ± 0.443
5.394ValLeu: 5.394 ± 0.491
2.15ValMet: 2.15 ± 0.326
4.026ValAsn: 4.026 ± 0.327
2.228ValPro: 2.228 ± 0.298
2.15ValGln: 2.15 ± 0.293
3.479ValArg: 3.479 ± 0.429
4.612ValSer: 4.612 ± 0.44
4.808ValThr: 4.808 ± 0.439
4.769ValVal: 4.769 ± 0.523
0.547ValTrp: 0.547 ± 0.171
2.697ValTyr: 2.697 ± 0.355
0.039ValXaa: 0.039 ± 0.044
Trp
0.391TrpAla: 0.391 ± 0.144
0.274TrpCys: 0.274 ± 0.096
0.899TrpAsp: 0.899 ± 0.189
0.743TrpGlu: 0.743 ± 0.147
0.508TrpPhe: 0.508 ± 0.142
0.664TrpGly: 0.664 ± 0.18
0.078TrpHis: 0.078 ± 0.056
0.547TrpIle: 0.547 ± 0.148
0.821TrpLys: 0.821 ± 0.156
1.407TrpLeu: 1.407 ± 0.245
0.43TrpMet: 0.43 ± 0.155
1.094TrpAsn: 1.094 ± 0.241
0.235TrpPro: 0.235 ± 0.089
0.586TrpGln: 0.586 ± 0.145
0.664TrpArg: 0.664 ± 0.206
0.625TrpSer: 0.625 ± 0.166
0.743TrpThr: 0.743 ± 0.157
0.586TrpVal: 0.586 ± 0.14
0.235TrpTrp: 0.235 ± 0.091
0.664TrpTyr: 0.664 ± 0.143
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.58TyrAla: 2.58 ± 0.345
0.899TyrCys: 0.899 ± 0.169
2.697TyrAsp: 2.697 ± 0.338
2.853TyrGlu: 2.853 ± 0.306
1.681TyrPhe: 1.681 ± 0.219
2.775TyrGly: 2.775 ± 0.338
0.86TyrHis: 0.86 ± 0.183
2.15TyrIle: 2.15 ± 0.301
3.205TyrLys: 3.205 ± 0.362
3.674TyrLeu: 3.674 ± 0.324
1.251TyrMet: 1.251 ± 0.202
2.462TyrAsn: 2.462 ± 0.319
1.563TyrPro: 1.563 ± 0.221
1.485TyrGln: 1.485 ± 0.332
1.798TyrArg: 1.798 ± 0.302
2.462TyrSer: 2.462 ± 0.311
3.635TyrThr: 3.635 ± 0.408
3.01TyrVal: 3.01 ± 0.316
0.352TyrTrp: 0.352 ± 0.12
1.876TyrTyr: 1.876 ± 0.278
0.039TyrXaa: 0.039 ± 0.04
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.039XaaIle: 0.039 ± 0.036
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.039XaaMet: 0.039 ± 0.041
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.078XaaTyr: 0.078 ± 0.058
0.078XaaXaa: 0.078 ± 0.082
Statistics based on 130 proteins (25585 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski