Amino acid dipepetide frequency for Escherichia phage Henu7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.269AlaAla: 11.269 ± 2.087
1.194AlaCys: 1.194 ± 0.325
4.851AlaAsp: 4.851 ± 0.67
5.299AlaGlu: 5.299 ± 0.835
2.761AlaPhe: 2.761 ± 0.532
6.045AlaGly: 6.045 ± 0.708
0.97AlaHis: 0.97 ± 0.329
5.448AlaIle: 5.448 ± 0.592
7.015AlaLys: 7.015 ± 1.227
6.941AlaLeu: 6.941 ± 0.658
3.135AlaMet: 3.135 ± 0.482
4.329AlaAsn: 4.329 ± 0.726
2.314AlaPro: 2.314 ± 0.385
4.03AlaGln: 4.03 ± 0.587
4.776AlaArg: 4.776 ± 0.582
7.463AlaSer: 7.463 ± 1.581
4.478AlaThr: 4.478 ± 0.743
5.597AlaVal: 5.597 ± 0.744
1.269AlaTrp: 1.269 ± 0.246
2.388AlaTyr: 2.388 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
1.045CysAla: 1.045 ± 0.32
0.075CysCys: 0.075 ± 0.068
1.269CysAsp: 1.269 ± 0.323
1.119CysGlu: 1.119 ± 0.329
0.299CysPhe: 0.299 ± 0.146
1.194CysGly: 1.194 ± 0.293
0.373CysHis: 0.373 ± 0.156
0.224CysIle: 0.224 ± 0.108
0.821CysLys: 0.821 ± 0.305
0.373CysLeu: 0.373 ± 0.142
0.522CysMet: 0.522 ± 0.195
0.522CysAsn: 0.522 ± 0.169
0.299CysPro: 0.299 ± 0.151
0.448CysGln: 0.448 ± 0.181
0.597CysArg: 0.597 ± 0.25
0.448CysSer: 0.448 ± 0.185
0.821CysThr: 0.821 ± 0.267
0.97CysVal: 0.97 ± 0.287
0.448CysTrp: 0.448 ± 0.183
0.746CysTyr: 0.746 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
4.329AspAla: 4.329 ± 0.6
0.448AspCys: 0.448 ± 0.164
3.582AspAsp: 3.582 ± 0.616
5.523AspGlu: 5.523 ± 0.695
2.314AspPhe: 2.314 ± 0.39
6.941AspGly: 6.941 ± 0.929
0.896AspHis: 0.896 ± 0.371
3.284AspIle: 3.284 ± 0.415
3.582AspLys: 3.582 ± 0.416
4.851AspLeu: 4.851 ± 0.539
1.717AspMet: 1.717 ± 0.318
2.09AspAsn: 2.09 ± 0.511
2.761AspPro: 2.761 ± 0.408
2.314AspGln: 2.314 ± 0.459
2.687AspArg: 2.687 ± 0.451
3.358AspSer: 3.358 ± 0.488
2.836AspThr: 2.836 ± 0.551
4.254AspVal: 4.254 ± 0.654
1.194AspTrp: 1.194 ± 0.269
2.463AspTyr: 2.463 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
5.971GluAla: 5.971 ± 0.625
1.119GluCys: 1.119 ± 0.308
3.135GluAsp: 3.135 ± 0.498
4.776GluGlu: 4.776 ± 0.709
3.582GluPhe: 3.582 ± 0.468
3.433GluGly: 3.433 ± 0.513
0.746GluHis: 0.746 ± 0.187
5.672GluIle: 5.672 ± 0.638
3.806GluLys: 3.806 ± 0.582
4.403GluLeu: 4.403 ± 0.551
3.06GluMet: 3.06 ± 0.502
3.06GluAsn: 3.06 ± 0.446
1.717GluPro: 1.717 ± 0.395
3.358GluGln: 3.358 ± 0.636
3.956GluArg: 3.956 ± 0.61
4.403GluSer: 4.403 ± 0.548
3.956GluThr: 3.956 ± 0.58
5.448GluVal: 5.448 ± 0.843
0.746GluTrp: 0.746 ± 0.205
2.761GluTyr: 2.761 ± 0.411
0.0GluXaa: 0.0 ± 0.0
Phe
2.015PheAla: 2.015 ± 0.32
0.597PheCys: 0.597 ± 0.255
3.508PheAsp: 3.508 ± 0.509
2.164PheGlu: 2.164 ± 0.465
1.119PhePhe: 1.119 ± 0.29
3.582PheGly: 3.582 ± 0.477
0.522PheHis: 0.522 ± 0.226
1.791PheIle: 1.791 ± 0.399
3.358PheLys: 3.358 ± 0.469
2.239PheLeu: 2.239 ± 0.424
1.343PheMet: 1.343 ± 0.328
2.538PheAsn: 2.538 ± 0.502
1.269PhePro: 1.269 ± 0.303
1.791PheGln: 1.791 ± 0.303
1.343PheArg: 1.343 ± 0.288
2.09PheSer: 2.09 ± 0.444
2.761PheThr: 2.761 ± 0.433
2.463PheVal: 2.463 ± 0.401
0.597PheTrp: 0.597 ± 0.196
0.896PheTyr: 0.896 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
5.523GlyAla: 5.523 ± 0.81
1.269GlyCys: 1.269 ± 0.332
4.254GlyAsp: 4.254 ± 0.642
6.568GlyGlu: 6.568 ± 0.708
2.761GlyPhe: 2.761 ± 0.392
6.418GlyGly: 6.418 ± 0.911
0.672GlyHis: 0.672 ± 0.271
4.03GlyIle: 4.03 ± 0.564
5.896GlyLys: 5.896 ± 0.558
5.597GlyLeu: 5.597 ± 0.551
2.239GlyMet: 2.239 ± 0.318
3.657GlyAsn: 3.657 ± 0.645
0.0GlyPro: 0.0 ± 0.0
2.314GlyGln: 2.314 ± 0.392
2.911GlyArg: 2.911 ± 0.395
5.597GlySer: 5.597 ± 0.638
3.135GlyThr: 3.135 ± 0.585
7.165GlyVal: 7.165 ± 0.718
1.343GlyTrp: 1.343 ± 0.296
3.956GlyTyr: 3.956 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
0.896HisAla: 0.896 ± 0.238
0.149HisCys: 0.149 ± 0.087
0.97HisAsp: 0.97 ± 0.293
0.746HisGlu: 0.746 ± 0.231
0.522HisPhe: 0.522 ± 0.19
1.119HisGly: 1.119 ± 0.232
0.448HisHis: 0.448 ± 0.237
1.791HisIle: 1.791 ± 0.43
1.194HisLys: 1.194 ± 0.341
1.194HisLeu: 1.194 ± 0.365
0.299HisMet: 0.299 ± 0.176
0.746HisAsn: 0.746 ± 0.269
0.896HisPro: 0.896 ± 0.324
0.373HisGln: 0.373 ± 0.142
0.896HisArg: 0.896 ± 0.245
0.522HisSer: 0.522 ± 0.216
1.045HisThr: 1.045 ± 0.291
0.896HisVal: 0.896 ± 0.295
0.224HisTrp: 0.224 ± 0.138
0.672HisTyr: 0.672 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
6.12IleAla: 6.12 ± 0.609
1.194IleCys: 1.194 ± 0.312
4.702IleAsp: 4.702 ± 0.46
4.03IleGlu: 4.03 ± 0.453
2.09IlePhe: 2.09 ± 0.352
3.135IleGly: 3.135 ± 0.404
1.343IleHis: 1.343 ± 0.37
3.135IleIle: 3.135 ± 0.465
4.478IleLys: 4.478 ± 0.612
3.657IleLeu: 3.657 ± 0.569
1.94IleMet: 1.94 ± 0.415
3.433IleAsn: 3.433 ± 0.458
2.388IlePro: 2.388 ± 0.378
2.239IleGln: 2.239 ± 0.408
3.433IleArg: 3.433 ± 0.552
4.105IleSer: 4.105 ± 0.46
4.03IleThr: 4.03 ± 0.494
3.209IleVal: 3.209 ± 0.368
1.194IleTrp: 1.194 ± 0.383
1.866IleTyr: 1.866 ± 0.335
0.0IleXaa: 0.0 ± 0.0
Lys
6.866LysAla: 6.866 ± 0.849
0.746LysCys: 0.746 ± 0.24
4.627LysAsp: 4.627 ± 0.567
5.896LysGlu: 5.896 ± 0.608
2.09LysPhe: 2.09 ± 0.376
4.329LysGly: 4.329 ± 0.525
1.567LysHis: 1.567 ± 0.36
3.806LysIle: 3.806 ± 0.55
3.732LysLys: 3.732 ± 0.455
5.0LysLeu: 5.0 ± 0.621
2.836LysMet: 2.836 ± 0.435
2.911LysAsn: 2.911 ± 0.456
2.911LysPro: 2.911 ± 0.544
2.538LysGln: 2.538 ± 0.449
3.881LysArg: 3.881 ± 0.623
3.433LysSer: 3.433 ± 0.404
4.553LysThr: 4.553 ± 0.782
4.478LysVal: 4.478 ± 0.63
0.97LysTrp: 0.97 ± 0.237
1.717LysTyr: 1.717 ± 0.308
0.0LysXaa: 0.0 ± 0.0
Leu
7.165LeuAla: 7.165 ± 0.958
0.672LeuCys: 0.672 ± 0.212
4.179LeuAsp: 4.179 ± 0.469
4.03LeuGlu: 4.03 ± 0.546
1.866LeuPhe: 1.866 ± 0.362
4.03LeuGly: 4.03 ± 0.614
0.821LeuHis: 0.821 ± 0.322
4.702LeuIle: 4.702 ± 0.557
4.851LeuLys: 4.851 ± 0.679
2.985LeuLeu: 2.985 ± 0.444
1.94LeuMet: 1.94 ± 0.388
3.732LeuAsn: 3.732 ± 0.512
3.06LeuPro: 3.06 ± 0.534
2.09LeuGln: 2.09 ± 0.336
4.105LeuArg: 4.105 ± 0.573
4.627LeuSer: 4.627 ± 0.557
4.776LeuThr: 4.776 ± 0.602
4.105LeuVal: 4.105 ± 0.454
1.119LeuTrp: 1.119 ± 0.341
1.94LeuTyr: 1.94 ± 0.341
0.0LeuXaa: 0.0 ± 0.0
Met
3.135MetAla: 3.135 ± 0.491
0.373MetCys: 0.373 ± 0.144
1.418MetAsp: 1.418 ± 0.334
2.164MetGlu: 2.164 ± 0.36
1.269MetPhe: 1.269 ± 0.352
1.493MetGly: 1.493 ± 0.324
0.597MetHis: 0.597 ± 0.182
2.761MetIle: 2.761 ± 0.476
2.687MetLys: 2.687 ± 0.448
2.164MetLeu: 2.164 ± 0.45
1.194MetMet: 1.194 ± 0.333
0.896MetAsn: 0.896 ± 0.245
1.418MetPro: 1.418 ± 0.286
1.717MetGln: 1.717 ± 0.379
1.94MetArg: 1.94 ± 0.371
2.015MetSer: 2.015 ± 0.348
1.642MetThr: 1.642 ± 0.355
1.866MetVal: 1.866 ± 0.313
0.373MetTrp: 0.373 ± 0.15
0.821MetTyr: 0.821 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.403AsnAla: 4.403 ± 0.713
0.373AsnCys: 0.373 ± 0.163
2.164AsnAsp: 2.164 ± 0.38
2.985AsnGlu: 2.985 ± 0.416
1.567AsnPhe: 1.567 ± 0.24
5.224AsnGly: 5.224 ± 0.716
0.522AsnHis: 0.522 ± 0.174
3.209AsnIle: 3.209 ± 0.369
3.209AsnLys: 3.209 ± 0.476
3.135AsnLeu: 3.135 ± 0.691
1.194AsnMet: 1.194 ± 0.326
2.015AsnAsn: 2.015 ± 0.418
2.239AsnPro: 2.239 ± 0.37
1.418AsnGln: 1.418 ± 0.24
1.567AsnArg: 1.567 ± 0.298
2.985AsnSer: 2.985 ± 0.451
1.343AsnThr: 1.343 ± 0.271
3.433AsnVal: 3.433 ± 0.581
0.821AsnTrp: 0.821 ± 0.244
1.119AsnTyr: 1.119 ± 0.338
0.0AsnXaa: 0.0 ± 0.0
Pro
2.09ProAla: 2.09 ± 0.335
0.448ProCys: 0.448 ± 0.255
2.761ProAsp: 2.761 ± 0.485
2.911ProGlu: 2.911 ± 0.608
1.418ProPhe: 1.418 ± 0.306
2.612ProGly: 2.612 ± 0.446
0.522ProHis: 0.522 ± 0.21
2.015ProIle: 2.015 ± 0.477
1.567ProLys: 1.567 ± 0.259
2.164ProLeu: 2.164 ± 0.364
0.746ProMet: 0.746 ± 0.294
1.642ProAsn: 1.642 ± 0.311
1.045ProPro: 1.045 ± 0.269
1.194ProGln: 1.194 ± 0.295
1.866ProArg: 1.866 ± 0.419
1.418ProSer: 1.418 ± 0.326
1.567ProThr: 1.567 ± 0.304
2.761ProVal: 2.761 ± 0.451
0.448ProTrp: 0.448 ± 0.163
1.418ProTyr: 1.418 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
4.254GlnAla: 4.254 ± 0.598
0.448GlnCys: 0.448 ± 0.187
1.866GlnAsp: 1.866 ± 0.37
1.94GlnGlu: 1.94 ± 0.309
0.97GlnPhe: 0.97 ± 0.232
2.538GlnGly: 2.538 ± 0.337
0.672GlnHis: 0.672 ± 0.253
2.538GlnIle: 2.538 ± 0.475
2.463GlnLys: 2.463 ± 0.489
2.239GlnLeu: 2.239 ± 0.507
1.119GlnMet: 1.119 ± 0.235
1.269GlnAsn: 1.269 ± 0.322
1.194GlnPro: 1.194 ± 0.333
2.687GlnGln: 2.687 ± 0.601
3.06GlnArg: 3.06 ± 0.479
2.463GlnSer: 2.463 ± 0.47
2.09GlnThr: 2.09 ± 0.352
3.135GlnVal: 3.135 ± 0.405
0.746GlnTrp: 0.746 ± 0.287
2.09GlnTyr: 2.09 ± 0.447
0.0GlnXaa: 0.0 ± 0.0
Arg
4.627ArgAla: 4.627 ± 0.606
0.896ArgCys: 0.896 ± 0.359
2.314ArgAsp: 2.314 ± 0.345
3.806ArgGlu: 3.806 ± 0.657
2.687ArgPhe: 2.687 ± 0.355
3.358ArgGly: 3.358 ± 0.437
0.299ArgHis: 0.299 ± 0.174
3.209ArgIle: 3.209 ± 0.5
4.553ArgLys: 4.553 ± 0.673
3.209ArgLeu: 3.209 ± 0.373
1.791ArgMet: 1.791 ± 0.365
2.164ArgAsn: 2.164 ± 0.479
1.418ArgPro: 1.418 ± 0.369
1.866ArgGln: 1.866 ± 0.376
2.985ArgArg: 2.985 ± 0.469
2.985ArgSer: 2.985 ± 0.516
2.015ArgThr: 2.015 ± 0.402
4.478ArgVal: 4.478 ± 0.654
0.821ArgTrp: 0.821 ± 0.246
1.791ArgTyr: 1.791 ± 0.292
0.0ArgXaa: 0.0 ± 0.0
Ser
7.314SerAla: 7.314 ± 1.361
0.299SerCys: 0.299 ± 0.164
3.657SerAsp: 3.657 ± 0.399
4.553SerGlu: 4.553 ± 0.611
3.209SerPhe: 3.209 ± 0.419
6.493SerGly: 6.493 ± 0.584
1.418SerHis: 1.418 ± 0.427
3.284SerIle: 3.284 ± 0.734
2.538SerLys: 2.538 ± 0.389
5.299SerLeu: 5.299 ± 0.522
1.194SerMet: 1.194 ± 0.317
2.538SerAsn: 2.538 ± 0.608
1.269SerPro: 1.269 ± 0.294
2.612SerGln: 2.612 ± 0.504
2.687SerArg: 2.687 ± 0.413
4.105SerSer: 4.105 ± 0.809
3.657SerThr: 3.657 ± 0.499
4.254SerVal: 4.254 ± 0.512
0.821SerTrp: 0.821 ± 0.308
2.09SerTyr: 2.09 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
5.299ThrAla: 5.299 ± 0.709
0.597ThrCys: 0.597 ± 0.229
2.687ThrAsp: 2.687 ± 0.498
2.985ThrGlu: 2.985 ± 0.525
2.164ThrPhe: 2.164 ± 0.401
6.045ThrGly: 6.045 ± 0.571
0.597ThrHis: 0.597 ± 0.197
3.135ThrIle: 3.135 ± 0.479
3.732ThrLys: 3.732 ± 0.592
3.956ThrLeu: 3.956 ± 0.448
1.642ThrMet: 1.642 ± 0.345
3.06ThrAsn: 3.06 ± 0.502
2.538ThrPro: 2.538 ± 0.358
2.538ThrGln: 2.538 ± 0.484
1.717ThrArg: 1.717 ± 0.489
3.806ThrSer: 3.806 ± 0.592
2.239ThrThr: 2.239 ± 0.324
3.06ThrVal: 3.06 ± 0.424
0.746ThrTrp: 0.746 ± 0.206
2.687ThrTyr: 2.687 ± 0.546
0.0ThrXaa: 0.0 ± 0.0
Val
5.374ValAla: 5.374 ± 0.703
1.194ValCys: 1.194 ± 0.282
5.747ValAsp: 5.747 ± 0.867
4.702ValGlu: 4.702 ± 0.599
3.657ValPhe: 3.657 ± 0.623
3.582ValGly: 3.582 ± 0.418
1.418ValHis: 1.418 ± 0.318
5.15ValIle: 5.15 ± 0.609
5.523ValLys: 5.523 ± 0.658
4.403ValLeu: 4.403 ± 0.553
2.612ValMet: 2.612 ± 0.423
2.463ValAsn: 2.463 ± 0.43
2.164ValPro: 2.164 ± 0.431
2.388ValGln: 2.388 ± 0.469
3.657ValArg: 3.657 ± 0.509
4.179ValSer: 4.179 ± 0.956
4.179ValThr: 4.179 ± 0.41
4.776ValVal: 4.776 ± 0.615
0.746ValTrp: 0.746 ± 0.252
1.717ValTyr: 1.717 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
1.045TrpAla: 1.045 ± 0.262
0.149TrpCys: 0.149 ± 0.099
1.045TrpAsp: 1.045 ± 0.238
0.746TrpGlu: 0.746 ± 0.293
0.597TrpPhe: 0.597 ± 0.324
1.045TrpGly: 1.045 ± 0.254
0.522TrpHis: 0.522 ± 0.163
0.97TrpIle: 0.97 ± 0.283
1.642TrpLys: 1.642 ± 0.31
1.119TrpLeu: 1.119 ± 0.281
0.448TrpMet: 0.448 ± 0.2
0.373TrpAsn: 0.373 ± 0.213
0.299TrpPro: 0.299 ± 0.191
0.448TrpGln: 0.448 ± 0.175
1.269TrpArg: 1.269 ± 0.288
0.97TrpSer: 0.97 ± 0.325
1.045TrpThr: 1.045 ± 0.29
0.896TrpVal: 0.896 ± 0.228
0.075TrpTrp: 0.075 ± 0.08
0.597TrpTyr: 0.597 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.836TyrAla: 2.836 ± 0.434
0.373TyrCys: 0.373 ± 0.162
2.687TyrAsp: 2.687 ± 0.487
2.164TyrGlu: 2.164 ± 0.475
1.045TyrPhe: 1.045 ± 0.236
2.687TyrGly: 2.687 ± 0.507
0.672TyrHis: 0.672 ± 0.194
1.567TyrIle: 1.567 ± 0.288
2.239TyrLys: 2.239 ± 0.468
1.866TyrLeu: 1.866 ± 0.314
0.97TyrMet: 0.97 ± 0.268
1.493TyrAsn: 1.493 ± 0.393
1.418TyrPro: 1.418 ± 0.32
1.418TyrGln: 1.418 ± 0.313
1.94TyrArg: 1.94 ± 0.415
2.314TyrSer: 2.314 ± 0.504
3.06TyrThr: 3.06 ± 0.468
2.388TyrVal: 2.388 ± 0.399
0.597TyrTrp: 0.597 ± 0.197
1.269TyrTyr: 1.269 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13400 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski