Amino acid dipepetide frequency for Escherichia phage St11Ph5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.925AlaAla: 7.925 ± 1.123
0.851AlaCys: 0.851 ± 0.267
4.791AlaAsp: 4.791 ± 0.481
5.508AlaGlu: 5.508 ± 0.484
2.418AlaPhe: 2.418 ± 0.372
6.09AlaGly: 6.09 ± 0.672
1.299AlaHis: 1.299 ± 0.242
4.925AlaIle: 4.925 ± 0.496
5.687AlaLys: 5.687 ± 0.625
7.612AlaLeu: 7.612 ± 0.987
2.776AlaMet: 2.776 ± 0.445
5.105AlaAsn: 5.105 ± 0.46
2.284AlaPro: 2.284 ± 0.411
3.985AlaGln: 3.985 ± 0.804
3.851AlaArg: 3.851 ± 0.494
4.746AlaSer: 4.746 ± 0.422
5.597AlaThr: 5.597 ± 0.66
5.866AlaVal: 5.866 ± 0.518
0.851AlaTrp: 0.851 ± 0.177
3.627AlaTyr: 3.627 ± 0.354
0.0AlaXaa: 0.0 ± 0.0
Cys
0.537CysAla: 0.537 ± 0.165
0.09CysCys: 0.09 ± 0.062
0.493CysAsp: 0.493 ± 0.161
0.493CysGlu: 0.493 ± 0.158
0.493CysPhe: 0.493 ± 0.169
0.493CysGly: 0.493 ± 0.178
0.269CysHis: 0.269 ± 0.107
0.806CysIle: 0.806 ± 0.234
0.896CysLys: 0.896 ± 0.202
0.672CysLeu: 0.672 ± 0.214
0.313CysMet: 0.313 ± 0.135
0.627CysAsn: 0.627 ± 0.169
0.448CysPro: 0.448 ± 0.167
0.09CysGln: 0.09 ± 0.072
0.358CysArg: 0.358 ± 0.133
0.448CysSer: 0.448 ± 0.151
0.537CysThr: 0.537 ± 0.183
0.672CysVal: 0.672 ± 0.204
0.179CysTrp: 0.179 ± 0.082
0.493CysTyr: 0.493 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
5.105AspAla: 5.105 ± 0.688
0.896AspCys: 0.896 ± 0.245
3.716AspAsp: 3.716 ± 0.475
3.806AspGlu: 3.806 ± 0.526
2.373AspPhe: 2.373 ± 0.299
3.313AspGly: 3.313 ± 0.369
0.851AspHis: 0.851 ± 0.197
4.97AspIle: 4.97 ± 0.593
3.224AspLys: 3.224 ± 0.355
4.746AspLeu: 4.746 ± 0.524
1.657AspMet: 1.657 ± 0.279
2.015AspAsn: 2.015 ± 0.312
2.731AspPro: 2.731 ± 0.321
2.06AspGln: 2.06 ± 0.361
2.642AspArg: 2.642 ± 0.381
4.03AspSer: 4.03 ± 0.355
3.672AspThr: 3.672 ± 0.37
3.448AspVal: 3.448 ± 0.416
0.806AspTrp: 0.806 ± 0.186
2.463AspTyr: 2.463 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
6.851GluAla: 6.851 ± 0.877
0.582GluCys: 0.582 ± 0.198
4.343GluAsp: 4.343 ± 0.553
5.821GluGlu: 5.821 ± 0.685
2.687GluPhe: 2.687 ± 0.362
3.985GluGly: 3.985 ± 0.403
0.761GluHis: 0.761 ± 0.19
3.09GluIle: 3.09 ± 0.327
4.03GluLys: 4.03 ± 0.526
6.224GluLeu: 6.224 ± 0.495
2.328GluMet: 2.328 ± 0.285
2.821GluAsn: 2.821 ± 0.279
3.224GluPro: 3.224 ± 0.465
2.955GluGln: 2.955 ± 0.425
1.791GluArg: 1.791 ± 0.305
3.313GluSer: 3.313 ± 0.35
2.955GluThr: 2.955 ± 0.373
4.791GluVal: 4.791 ± 0.566
0.672GluTrp: 0.672 ± 0.16
2.552GluTyr: 2.552 ± 0.323
0.0GluXaa: 0.0 ± 0.0
Phe
2.642PheAla: 2.642 ± 0.379
0.403PheCys: 0.403 ± 0.151
2.284PheAsp: 2.284 ± 0.333
1.881PheGlu: 1.881 ± 0.272
1.119PhePhe: 1.119 ± 0.179
2.552PheGly: 2.552 ± 0.308
0.627PheHis: 0.627 ± 0.227
2.776PheIle: 2.776 ± 0.41
1.746PheLys: 1.746 ± 0.375
2.508PheLeu: 2.508 ± 0.408
1.567PheMet: 1.567 ± 0.25
2.418PheAsn: 2.418 ± 0.364
1.119PhePro: 1.119 ± 0.222
1.746PheGln: 1.746 ± 0.265
1.97PheArg: 1.97 ± 0.251
2.06PheSer: 2.06 ± 0.301
2.687PheThr: 2.687 ± 0.37
2.105PheVal: 2.105 ± 0.343
0.358PheTrp: 0.358 ± 0.132
1.478PheTyr: 1.478 ± 0.224
0.0PheXaa: 0.0 ± 0.0
Gly
4.97GlyAla: 4.97 ± 0.512
0.896GlyCys: 0.896 ± 0.261
3.134GlyAsp: 3.134 ± 0.394
3.94GlyGlu: 3.94 ± 0.489
2.687GlyPhe: 2.687 ± 0.41
3.985GlyGly: 3.985 ± 0.554
1.299GlyHis: 1.299 ± 0.257
3.94GlyIle: 3.94 ± 0.477
6.09GlyLys: 6.09 ± 0.544
5.194GlyLeu: 5.194 ± 0.576
2.328GlyMet: 2.328 ± 0.335
5.373GlyAsn: 5.373 ± 0.491
0.985GlyPro: 0.985 ± 0.228
2.328GlyGln: 2.328 ± 0.372
2.328GlyArg: 2.328 ± 0.371
4.657GlySer: 4.657 ± 0.382
4.657GlyThr: 4.657 ± 0.44
3.851GlyVal: 3.851 ± 0.52
0.94GlyTrp: 0.94 ± 0.203
2.552GlyTyr: 2.552 ± 0.298
0.0GlyXaa: 0.0 ± 0.0
His
1.254HisAla: 1.254 ± 0.231
0.224HisCys: 0.224 ± 0.101
1.209HisAsp: 1.209 ± 0.288
0.94HisGlu: 0.94 ± 0.215
0.537HisPhe: 0.537 ± 0.138
0.806HisGly: 0.806 ± 0.187
0.403HisHis: 0.403 ± 0.13
1.343HisIle: 1.343 ± 0.269
1.433HisLys: 1.433 ± 0.304
1.836HisLeu: 1.836 ± 0.292
0.448HisMet: 0.448 ± 0.133
0.806HisAsn: 0.806 ± 0.176
0.716HisPro: 0.716 ± 0.17
0.448HisGln: 0.448 ± 0.178
0.761HisArg: 0.761 ± 0.229
1.478HisSer: 1.478 ± 0.255
1.075HisThr: 1.075 ± 0.194
0.806HisVal: 0.806 ± 0.163
0.313HisTrp: 0.313 ± 0.132
0.985HisTyr: 0.985 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
4.925IleAla: 4.925 ± 0.549
0.403IleCys: 0.403 ± 0.144
3.761IleAsp: 3.761 ± 0.515
4.164IleGlu: 4.164 ± 0.331
1.97IlePhe: 1.97 ± 0.399
3.627IleGly: 3.627 ± 0.4
1.299IleHis: 1.299 ± 0.263
3.448IleIle: 3.448 ± 0.588
4.075IleLys: 4.075 ± 0.431
4.388IleLeu: 4.388 ± 0.531
1.299IleMet: 1.299 ± 0.201
3.582IleAsn: 3.582 ± 0.416
3.448IlePro: 3.448 ± 0.405
2.642IleGln: 2.642 ± 0.311
3.358IleArg: 3.358 ± 0.334
3.0IleSer: 3.0 ± 0.312
4.702IleThr: 4.702 ± 0.378
3.269IleVal: 3.269 ± 0.403
0.493IleTrp: 0.493 ± 0.193
2.149IleTyr: 2.149 ± 0.302
0.0IleXaa: 0.0 ± 0.0
Lys
6.627LysAla: 6.627 ± 0.735
0.537LysCys: 0.537 ± 0.187
3.537LysAsp: 3.537 ± 0.363
4.881LysGlu: 4.881 ± 0.582
2.239LysPhe: 2.239 ± 0.319
3.94LysGly: 3.94 ± 0.428
1.388LysHis: 1.388 ± 0.328
3.09LysIle: 3.09 ± 0.354
3.403LysLys: 3.403 ± 0.491
6.761LysLeu: 6.761 ± 0.535
1.612LysMet: 1.612 ± 0.284
3.179LysAsn: 3.179 ± 0.359
3.313LysPro: 3.313 ± 0.505
2.866LysGln: 2.866 ± 0.332
2.821LysArg: 2.821 ± 0.317
4.209LysSer: 4.209 ± 0.439
4.209LysThr: 4.209 ± 0.41
3.761LysVal: 3.761 ± 0.357
0.627LysTrp: 0.627 ± 0.198
2.149LysTyr: 2.149 ± 0.377
0.0LysXaa: 0.0 ± 0.0
Leu
7.657LeuAla: 7.657 ± 0.706
0.761LeuCys: 0.761 ± 0.203
4.702LeuAsp: 4.702 ± 0.458
4.388LeuGlu: 4.388 ± 0.441
2.91LeuPhe: 2.91 ± 0.311
6.314LeuGly: 6.314 ± 0.518
1.746LeuHis: 1.746 ± 0.299
4.567LeuIle: 4.567 ± 0.434
5.284LeuLys: 5.284 ± 0.479
6.269LeuLeu: 6.269 ± 0.619
2.597LeuMet: 2.597 ± 0.419
5.105LeuAsn: 5.105 ± 0.54
4.388LeuPro: 4.388 ± 0.488
3.0LeuGln: 3.0 ± 0.385
3.448LeuArg: 3.448 ± 0.382
5.149LeuSer: 5.149 ± 0.467
5.552LeuThr: 5.552 ± 0.489
5.642LeuVal: 5.642 ± 0.511
1.075LeuTrp: 1.075 ± 0.21
2.597LeuTyr: 2.597 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
2.508MetAla: 2.508 ± 0.431
0.179MetCys: 0.179 ± 0.095
1.343MetAsp: 1.343 ± 0.274
1.97MetGlu: 1.97 ± 0.334
0.448MetPhe: 0.448 ± 0.131
1.836MetGly: 1.836 ± 0.241
0.358MetHis: 0.358 ± 0.097
1.791MetIle: 1.791 ± 0.285
2.284MetLys: 2.284 ± 0.322
2.508MetLeu: 2.508 ± 0.344
0.806MetMet: 0.806 ± 0.168
2.284MetAsn: 2.284 ± 0.278
1.03MetPro: 1.03 ± 0.259
1.746MetGln: 1.746 ± 0.273
1.03MetArg: 1.03 ± 0.164
2.284MetSer: 2.284 ± 0.373
1.791MetThr: 1.791 ± 0.331
1.612MetVal: 1.612 ± 0.278
0.313MetTrp: 0.313 ± 0.096
0.806MetTyr: 0.806 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
4.612AsnAla: 4.612 ± 0.667
0.358AsnCys: 0.358 ± 0.126
2.687AsnAsp: 2.687 ± 0.302
3.313AsnGlu: 3.313 ± 0.415
2.06AsnPhe: 2.06 ± 0.361
4.164AsnGly: 4.164 ± 0.587
1.299AsnHis: 1.299 ± 0.25
3.313AsnIle: 3.313 ± 0.398
3.851AsnLys: 3.851 ± 0.522
4.343AsnLeu: 4.343 ± 0.517
1.343AsnMet: 1.343 ± 0.202
3.403AsnAsn: 3.403 ± 0.365
3.493AsnPro: 3.493 ± 0.382
3.09AsnGln: 3.09 ± 0.409
3.179AsnArg: 3.179 ± 0.3
3.493AsnSer: 3.493 ± 0.507
3.0AsnThr: 3.0 ± 0.501
3.448AsnVal: 3.448 ± 0.419
0.672AsnTrp: 0.672 ± 0.149
2.149AsnTyr: 2.149 ± 0.42
0.0AsnXaa: 0.0 ± 0.0
Pro
3.403ProAla: 3.403 ± 0.397
0.269ProCys: 0.269 ± 0.13
2.463ProAsp: 2.463 ± 0.45
3.94ProGlu: 3.94 ± 0.444
1.97ProPhe: 1.97 ± 0.27
2.508ProGly: 2.508 ± 0.285
0.313ProHis: 0.313 ± 0.132
2.105ProIle: 2.105 ± 0.303
1.791ProLys: 1.791 ± 0.319
2.91ProLeu: 2.91 ± 0.346
1.119ProMet: 1.119 ± 0.227
2.149ProAsn: 2.149 ± 0.291
0.896ProPro: 0.896 ± 0.243
1.702ProGln: 1.702 ± 0.279
1.299ProArg: 1.299 ± 0.258
2.328ProSer: 2.328 ± 0.292
3.313ProThr: 3.313 ± 0.344
3.94ProVal: 3.94 ± 0.355
0.493ProTrp: 0.493 ± 0.151
1.567ProTyr: 1.567 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
4.433GlnAla: 4.433 ± 0.62
0.224GlnCys: 0.224 ± 0.11
2.105GlnAsp: 2.105 ± 0.339
2.776GlnGlu: 2.776 ± 0.348
1.567GlnPhe: 1.567 ± 0.275
2.508GlnGly: 2.508 ± 0.42
0.493GlnHis: 0.493 ± 0.124
2.373GlnIle: 2.373 ± 0.3
2.866GlnLys: 2.866 ± 0.421
3.403GlnLeu: 3.403 ± 0.414
1.164GlnMet: 1.164 ± 0.22
2.015GlnAsn: 2.015 ± 0.432
1.03GlnPro: 1.03 ± 0.242
1.702GlnGln: 1.702 ± 0.335
1.522GlnArg: 1.522 ± 0.264
2.731GlnSer: 2.731 ± 0.351
2.328GlnThr: 2.328 ± 0.308
3.537GlnVal: 3.537 ± 0.346
0.627GlnTrp: 0.627 ± 0.147
1.836GlnTyr: 1.836 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
3.716ArgAla: 3.716 ± 0.47
0.448ArgCys: 0.448 ± 0.156
2.508ArgAsp: 2.508 ± 0.337
2.955ArgGlu: 2.955 ± 0.454
1.522ArgPhe: 1.522 ± 0.257
2.508ArgGly: 2.508 ± 0.26
0.806ArgHis: 0.806 ± 0.225
3.134ArgIle: 3.134 ± 0.509
3.493ArgLys: 3.493 ± 0.447
3.985ArgLeu: 3.985 ± 0.5
1.433ArgMet: 1.433 ± 0.268
3.09ArgAsn: 3.09 ± 0.371
1.478ArgPro: 1.478 ± 0.265
1.657ArgGln: 1.657 ± 0.271
2.239ArgArg: 2.239 ± 0.334
2.463ArgSer: 2.463 ± 0.38
2.239ArgThr: 2.239 ± 0.284
2.687ArgVal: 2.687 ± 0.345
0.537ArgTrp: 0.537 ± 0.165
1.343ArgTyr: 1.343 ± 0.213
0.0ArgXaa: 0.0 ± 0.0
Ser
4.836SerAla: 4.836 ± 0.581
0.537SerCys: 0.537 ± 0.157
3.403SerAsp: 3.403 ± 0.363
4.03SerGlu: 4.03 ± 0.38
1.925SerPhe: 1.925 ± 0.332
5.105SerGly: 5.105 ± 0.51
0.94SerHis: 0.94 ± 0.246
3.358SerIle: 3.358 ± 0.39
3.403SerLys: 3.403 ± 0.424
6.493SerLeu: 6.493 ± 0.59
1.522SerMet: 1.522 ± 0.261
2.866SerAsn: 2.866 ± 0.311
2.418SerPro: 2.418 ± 0.44
1.925SerGln: 1.925 ± 0.381
2.866SerArg: 2.866 ± 0.345
3.224SerSer: 3.224 ± 0.533
4.522SerThr: 4.522 ± 0.588
4.343SerVal: 4.343 ± 0.536
0.627SerTrp: 0.627 ± 0.236
2.105SerTyr: 2.105 ± 0.364
0.0SerXaa: 0.0 ± 0.0
Thr
4.567ThrAla: 4.567 ± 0.515
0.403ThrCys: 0.403 ± 0.133
3.985ThrAsp: 3.985 ± 0.297
3.716ThrGlu: 3.716 ± 0.418
2.776ThrPhe: 2.776 ± 0.259
4.657ThrGly: 4.657 ± 0.536
0.985ThrHis: 0.985 ± 0.254
4.478ThrIle: 4.478 ± 0.391
3.985ThrLys: 3.985 ± 0.487
5.194ThrLeu: 5.194 ± 0.483
1.388ThrMet: 1.388 ± 0.286
3.716ThrAsn: 3.716 ± 0.454
3.134ThrPro: 3.134 ± 0.432
2.149ThrGln: 2.149 ± 0.295
2.508ThrArg: 2.508 ± 0.305
3.761ThrSer: 3.761 ± 0.523
3.09ThrThr: 3.09 ± 0.43
4.702ThrVal: 4.702 ± 0.533
0.716ThrTrp: 0.716 ± 0.174
2.597ThrTyr: 2.597 ± 0.384
0.0ThrXaa: 0.0 ± 0.0
Val
5.687ValAla: 5.687 ± 0.634
0.582ValCys: 0.582 ± 0.21
4.254ValAsp: 4.254 ± 0.498
4.388ValGlu: 4.388 ± 0.385
2.194ValPhe: 2.194 ± 0.307
4.388ValGly: 4.388 ± 0.561
1.702ValHis: 1.702 ± 0.241
3.403ValIle: 3.403 ± 0.479
4.254ValLys: 4.254 ± 0.488
4.343ValLeu: 4.343 ± 0.408
1.97ValMet: 1.97 ± 0.282
4.03ValAsn: 4.03 ± 0.459
2.552ValPro: 2.552 ± 0.499
3.09ValGln: 3.09 ± 0.371
4.075ValArg: 4.075 ± 0.359
4.119ValSer: 4.119 ± 0.471
4.343ValThr: 4.343 ± 0.494
5.149ValVal: 5.149 ± 0.562
0.672ValTrp: 0.672 ± 0.157
2.776ValTyr: 2.776 ± 0.44
0.0ValXaa: 0.0 ± 0.0
Trp
0.806TrpAla: 0.806 ± 0.197
0.179TrpCys: 0.179 ± 0.085
1.119TrpAsp: 1.119 ± 0.28
0.493TrpGlu: 0.493 ± 0.182
0.448TrpPhe: 0.448 ± 0.145
0.582TrpGly: 0.582 ± 0.146
0.269TrpHis: 0.269 ± 0.11
0.672TrpIle: 0.672 ± 0.162
0.672TrpLys: 0.672 ± 0.162
1.209TrpLeu: 1.209 ± 0.196
0.313TrpMet: 0.313 ± 0.12
0.403TrpAsn: 0.403 ± 0.177
0.313TrpPro: 0.313 ± 0.106
0.448TrpGln: 0.448 ± 0.13
0.537TrpArg: 0.537 ± 0.139
0.627TrpSer: 0.627 ± 0.164
0.493TrpThr: 0.493 ± 0.174
1.299TrpVal: 1.299 ± 0.216
0.045TrpTrp: 0.045 ± 0.042
0.448TrpTyr: 0.448 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.731TyrAla: 2.731 ± 0.284
0.582TyrCys: 0.582 ± 0.183
2.687TyrAsp: 2.687 ± 0.259
2.373TyrGlu: 2.373 ± 0.349
1.702TyrPhe: 1.702 ± 0.327
2.776TyrGly: 2.776 ± 0.544
0.716TyrHis: 0.716 ± 0.162
2.418TyrIle: 2.418 ± 0.379
2.687TyrLys: 2.687 ± 0.378
2.552TyrLeu: 2.552 ± 0.263
0.896TyrMet: 0.896 ± 0.183
2.328TyrAsn: 2.328 ± 0.28
1.478TyrPro: 1.478 ± 0.299
1.612TyrGln: 1.612 ± 0.27
1.657TyrArg: 1.657 ± 0.39
2.328TyrSer: 2.328 ± 0.451
1.746TyrThr: 1.746 ± 0.271
3.045TyrVal: 3.045 ± 0.427
0.358TyrTrp: 0.358 ± 0.109
1.075TyrTyr: 1.075 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (22334 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski