Amino acid dipepetide frequency for Escherichia phage HZ2R8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.239AlaAla: 9.239 ± 1.627
0.628AlaCys: 0.628 ± 0.261
5.562AlaAsp: 5.562 ± 0.617
5.382AlaGlu: 5.382 ± 0.733
2.96AlaPhe: 2.96 ± 0.506
7.535AlaGly: 7.535 ± 0.782
1.704AlaHis: 1.704 ± 0.296
5.203AlaIle: 5.203 ± 0.956
6.997AlaLys: 6.997 ± 0.728
9.06AlaLeu: 9.06 ± 1.102
2.87AlaMet: 2.87 ± 0.518
4.485AlaAsn: 4.485 ± 0.585
2.601AlaPro: 2.601 ± 0.384
4.037AlaGln: 4.037 ± 0.778
6.189AlaArg: 6.189 ± 0.576
5.203AlaSer: 5.203 ± 0.834
3.857AlaThr: 3.857 ± 0.554
4.844AlaVal: 4.844 ± 0.617
1.704AlaTrp: 1.704 ± 0.418
2.781AlaTyr: 2.781 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.718CysAla: 0.718 ± 0.29
0.09CysCys: 0.09 ± 0.108
1.435CysAsp: 1.435 ± 0.482
0.449CysGlu: 0.449 ± 0.228
0.718CysPhe: 0.718 ± 0.35
0.807CysGly: 0.807 ± 0.364
0.718CysHis: 0.718 ± 0.275
0.718CysIle: 0.718 ± 0.289
0.538CysLys: 0.538 ± 0.219
0.538CysLeu: 0.538 ± 0.238
0.179CysMet: 0.179 ± 0.141
0.359CysAsn: 0.359 ± 0.181
0.449CysPro: 0.449 ± 0.192
0.538CysGln: 0.538 ± 0.213
0.449CysArg: 0.449 ± 0.204
1.166CysSer: 1.166 ± 0.561
0.09CysThr: 0.09 ± 0.108
0.718CysVal: 0.718 ± 0.32
0.359CysTrp: 0.359 ± 0.211
0.359CysTyr: 0.359 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
5.382AspAla: 5.382 ± 0.821
0.538AspCys: 0.538 ± 0.211
4.216AspAsp: 4.216 ± 0.759
4.037AspGlu: 4.037 ± 0.564
3.05AspPhe: 3.05 ± 0.401
6.279AspGly: 6.279 ± 0.63
0.628AspHis: 0.628 ± 0.236
3.14AspIle: 3.14 ± 0.379
4.485AspLys: 4.485 ± 0.617
4.306AspLeu: 4.306 ± 0.756
2.512AspMet: 2.512 ± 0.563
3.14AspAsn: 3.14 ± 0.539
3.319AspPro: 3.319 ± 0.602
1.704AspGln: 1.704 ± 0.477
2.691AspArg: 2.691 ± 0.535
4.395AspSer: 4.395 ± 0.628
3.498AspThr: 3.498 ± 0.496
4.485AspVal: 4.485 ± 0.501
0.897AspTrp: 0.897 ± 0.293
1.435AspTyr: 1.435 ± 0.375
0.0AspXaa: 0.0 ± 0.0
Glu
7.086GluAla: 7.086 ± 1.079
0.449GluCys: 0.449 ± 0.22
3.947GluAsp: 3.947 ± 0.708
7.176GluGlu: 7.176 ± 0.938
2.063GluPhe: 2.063 ± 0.451
5.292GluGly: 5.292 ± 0.821
1.525GluHis: 1.525 ± 0.383
3.498GluIle: 3.498 ± 0.43
3.409GluLys: 3.409 ± 0.594
6.548GluLeu: 6.548 ± 0.614
2.153GluMet: 2.153 ± 0.552
3.319GluAsn: 3.319 ± 0.478
1.525GluPro: 1.525 ± 0.459
3.767GluGln: 3.767 ± 1.03
4.037GluArg: 4.037 ± 0.582
5.113GluSer: 5.113 ± 0.836
3.857GluThr: 3.857 ± 0.515
4.395GluVal: 4.395 ± 0.862
1.076GluTrp: 1.076 ± 0.361
3.409GluTyr: 3.409 ± 0.664
0.0GluXaa: 0.0 ± 0.0
Phe
1.794PheAla: 1.794 ± 0.417
0.538PheCys: 0.538 ± 0.208
2.691PheAsp: 2.691 ± 0.429
2.601PheGlu: 2.601 ± 0.417
0.897PhePhe: 0.897 ± 0.235
3.409PheGly: 3.409 ± 0.518
0.538PheHis: 0.538 ± 0.319
1.884PheIle: 1.884 ± 0.512
2.243PheLys: 2.243 ± 0.464
2.87PheLeu: 2.87 ± 0.506
1.525PheMet: 1.525 ± 0.288
1.615PheAsn: 1.615 ± 0.444
1.435PhePro: 1.435 ± 0.478
1.076PheGln: 1.076 ± 0.272
2.063PheArg: 2.063 ± 0.385
1.794PheSer: 1.794 ± 0.468
2.422PheThr: 2.422 ± 0.379
1.615PheVal: 1.615 ± 0.398
0.269PheTrp: 0.269 ± 0.126
1.166PheTyr: 1.166 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
6.01GlyAla: 6.01 ± 0.953
1.076GlyCys: 1.076 ± 0.361
4.306GlyAsp: 4.306 ± 0.747
5.023GlyGlu: 5.023 ± 0.633
3.857GlyPhe: 3.857 ± 0.656
5.382GlyGly: 5.382 ± 0.746
1.525GlyHis: 1.525 ± 0.41
4.306GlyIle: 4.306 ± 0.7
5.562GlyLys: 5.562 ± 0.937
5.472GlyLeu: 5.472 ± 0.947
2.332GlyMet: 2.332 ± 0.493
2.781GlyAsn: 2.781 ± 0.521
0.897GlyPro: 0.897 ± 0.317
2.422GlyGln: 2.422 ± 0.434
5.292GlyArg: 5.292 ± 0.581
4.754GlySer: 4.754 ± 0.762
3.947GlyThr: 3.947 ± 0.655
4.485GlyVal: 4.485 ± 0.511
1.525GlyTrp: 1.525 ± 0.515
2.691GlyTyr: 2.691 ± 0.438
0.0GlyXaa: 0.0 ± 0.0
His
1.704HisAla: 1.704 ± 0.444
0.359HisCys: 0.359 ± 0.19
0.987HisAsp: 0.987 ± 0.269
1.794HisGlu: 1.794 ± 0.549
0.718HisPhe: 0.718 ± 0.276
0.987HisGly: 0.987 ± 0.273
0.628HisHis: 0.628 ± 0.274
1.346HisIle: 1.346 ± 0.35
1.346HisLys: 1.346 ± 0.328
2.332HisLeu: 2.332 ± 0.542
0.359HisMet: 0.359 ± 0.151
0.538HisAsn: 0.538 ± 0.235
0.359HisPro: 0.359 ± 0.182
0.179HisGln: 0.179 ± 0.109
0.987HisArg: 0.987 ± 0.212
1.076HisSer: 1.076 ± 0.247
0.538HisThr: 0.538 ± 0.237
1.256HisVal: 1.256 ± 0.288
0.269HisTrp: 0.269 ± 0.154
0.987HisTyr: 0.987 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
4.126IleAla: 4.126 ± 0.475
0.628IleCys: 0.628 ± 0.184
3.857IleAsp: 3.857 ± 0.56
4.037IleGlu: 4.037 ± 0.754
0.807IlePhe: 0.807 ± 0.211
3.947IleGly: 3.947 ± 0.553
1.076IleHis: 1.076 ± 0.408
3.678IleIle: 3.678 ± 0.64
3.409IleLys: 3.409 ± 0.609
3.767IleLeu: 3.767 ± 0.515
1.346IleMet: 1.346 ± 0.369
3.588IleAsn: 3.588 ± 0.694
2.512IlePro: 2.512 ± 0.366
1.615IleGln: 1.615 ± 0.49
3.319IleArg: 3.319 ± 0.517
2.781IleSer: 2.781 ± 0.499
2.87IleThr: 2.87 ± 0.375
2.781IleVal: 2.781 ± 0.411
0.628IleTrp: 0.628 ± 0.207
1.794IleTyr: 1.794 ± 0.387
0.0IleXaa: 0.0 ± 0.0
Lys
8.342LysAla: 8.342 ± 1.172
0.628LysCys: 0.628 ± 0.256
3.588LysAsp: 3.588 ± 0.491
4.665LysGlu: 4.665 ± 0.738
1.794LysPhe: 1.794 ± 0.281
4.665LysGly: 4.665 ± 0.732
1.525LysHis: 1.525 ± 0.384
2.332LysIle: 2.332 ± 0.45
4.306LysLys: 4.306 ± 0.909
5.023LysLeu: 5.023 ± 0.867
1.884LysMet: 1.884 ± 0.459
2.332LysAsn: 2.332 ± 0.315
2.87LysPro: 2.87 ± 0.617
2.512LysGln: 2.512 ± 0.58
4.216LysArg: 4.216 ± 0.762
3.678LysSer: 3.678 ± 0.577
3.229LysThr: 3.229 ± 0.503
3.857LysVal: 3.857 ± 0.662
0.718LysTrp: 0.718 ± 0.267
2.781LysTyr: 2.781 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
9.06LeuAla: 9.06 ± 1.075
0.628LeuCys: 0.628 ± 0.32
4.754LeuAsp: 4.754 ± 0.556
6.01LeuGlu: 6.01 ± 0.82
2.422LeuPhe: 2.422 ± 0.427
4.216LeuGly: 4.216 ± 0.631
1.076LeuHis: 1.076 ± 0.314
3.678LeuIle: 3.678 ± 0.489
5.292LeuLys: 5.292 ± 0.659
5.203LeuLeu: 5.203 ± 0.891
2.422LeuMet: 2.422 ± 0.438
4.395LeuAsn: 4.395 ± 0.569
3.05LeuPro: 3.05 ± 0.555
3.947LeuGln: 3.947 ± 0.61
6.189LeuArg: 6.189 ± 0.756
4.844LeuSer: 4.844 ± 0.714
4.575LeuThr: 4.575 ± 0.672
4.306LeuVal: 4.306 ± 0.618
1.256LeuTrp: 1.256 ± 0.445
3.05LeuTyr: 3.05 ± 0.616
0.0LeuXaa: 0.0 ± 0.0
Met
3.14MetAla: 3.14 ± 0.521
0.359MetCys: 0.359 ± 0.177
2.422MetAsp: 2.422 ± 0.353
2.063MetGlu: 2.063 ± 0.436
0.987MetPhe: 0.987 ± 0.357
2.243MetGly: 2.243 ± 0.393
0.628MetHis: 0.628 ± 0.201
1.256MetIle: 1.256 ± 0.362
1.076MetLys: 1.076 ± 0.202
3.498MetLeu: 3.498 ± 0.541
0.807MetMet: 0.807 ± 0.24
1.256MetAsn: 1.256 ± 0.379
1.346MetPro: 1.346 ± 0.29
1.346MetGln: 1.346 ± 0.278
1.435MetArg: 1.435 ± 0.396
1.435MetSer: 1.435 ± 0.461
1.615MetThr: 1.615 ± 0.375
2.691MetVal: 2.691 ± 0.501
0.0MetTrp: 0.0 ± 0.0
0.807MetTyr: 0.807 ± 0.338
0.0MetXaa: 0.0 ± 0.0
Asn
3.498AsnAla: 3.498 ± 0.435
0.807AsnCys: 0.807 ± 0.222
2.781AsnAsp: 2.781 ± 0.486
3.05AsnGlu: 3.05 ± 0.483
1.525AsnPhe: 1.525 ± 0.323
4.126AsnGly: 4.126 ± 0.687
0.987AsnHis: 0.987 ± 0.355
2.87AsnIle: 2.87 ± 0.517
2.601AsnLys: 2.601 ± 0.483
3.05AsnLeu: 3.05 ± 0.567
1.166AsnMet: 1.166 ± 0.338
1.794AsnAsn: 1.794 ± 0.461
2.87AsnPro: 2.87 ± 0.567
1.884AsnGln: 1.884 ± 0.451
2.332AsnArg: 2.332 ± 0.634
2.96AsnSer: 2.96 ± 0.702
2.601AsnThr: 2.601 ± 0.706
3.498AsnVal: 3.498 ± 0.895
0.359AsnTrp: 0.359 ± 0.167
1.346AsnTyr: 1.346 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
3.229ProAla: 3.229 ± 0.331
0.628ProCys: 0.628 ± 0.247
2.781ProAsp: 2.781 ± 0.579
3.947ProGlu: 3.947 ± 0.761
1.435ProPhe: 1.435 ± 0.341
0.718ProGly: 0.718 ± 0.234
0.718ProHis: 0.718 ± 0.214
1.615ProIle: 1.615 ± 0.313
2.332ProLys: 2.332 ± 0.466
2.601ProLeu: 2.601 ± 0.509
1.525ProMet: 1.525 ± 0.37
2.243ProAsn: 2.243 ± 0.41
0.628ProPro: 0.628 ± 0.216
1.256ProGln: 1.256 ± 0.325
1.794ProArg: 1.794 ± 0.444
2.332ProSer: 2.332 ± 0.432
1.973ProThr: 1.973 ± 0.457
2.063ProVal: 2.063 ± 0.279
0.718ProTrp: 0.718 ± 0.218
1.166ProTyr: 1.166 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
3.588GlnAla: 3.588 ± 1.018
0.269GlnCys: 0.269 ± 0.172
1.704GlnAsp: 1.704 ± 0.382
3.319GlnGlu: 3.319 ± 0.635
2.063GlnPhe: 2.063 ± 0.322
2.422GlnGly: 2.422 ± 0.457
0.359GlnHis: 0.359 ± 0.198
1.704GlnIle: 1.704 ± 0.44
2.332GlnLys: 2.332 ± 0.42
4.306GlnLeu: 4.306 ± 0.46
1.256GlnMet: 1.256 ± 0.352
1.256GlnAsn: 1.256 ± 0.335
1.525GlnPro: 1.525 ± 0.415
1.884GlnGln: 1.884 ± 0.367
2.063GlnArg: 2.063 ± 0.591
2.243GlnSer: 2.243 ± 0.503
1.615GlnThr: 1.615 ± 0.43
2.153GlnVal: 2.153 ± 0.518
1.076GlnTrp: 1.076 ± 0.319
1.256GlnTyr: 1.256 ± 0.407
0.0GlnXaa: 0.0 ± 0.0
Arg
5.831ArgAla: 5.831 ± 0.537
1.256ArgCys: 1.256 ± 0.47
4.037ArgAsp: 4.037 ± 0.545
4.485ArgGlu: 4.485 ± 0.791
2.063ArgPhe: 2.063 ± 0.405
3.498ArgGly: 3.498 ± 0.426
0.987ArgHis: 0.987 ± 0.389
3.498ArgIle: 3.498 ± 0.428
4.126ArgLys: 4.126 ± 0.64
4.934ArgLeu: 4.934 ± 0.647
1.525ArgMet: 1.525 ± 0.371
2.332ArgAsn: 2.332 ± 0.583
2.153ArgPro: 2.153 ± 0.353
1.973ArgGln: 1.973 ± 0.486
3.409ArgArg: 3.409 ± 0.504
3.319ArgSer: 3.319 ± 0.466
2.601ArgThr: 2.601 ± 0.388
3.678ArgVal: 3.678 ± 0.507
0.987ArgTrp: 0.987 ± 0.252
1.794ArgTyr: 1.794 ± 0.506
0.0ArgXaa: 0.0 ± 0.0
Ser
5.92SerAla: 5.92 ± 0.684
0.718SerCys: 0.718 ± 0.318
5.113SerAsp: 5.113 ± 0.626
3.409SerGlu: 3.409 ± 0.496
2.332SerPhe: 2.332 ± 0.48
5.562SerGly: 5.562 ± 1.01
1.076SerHis: 1.076 ± 0.308
3.05SerIle: 3.05 ± 0.602
3.588SerLys: 3.588 ± 0.617
3.857SerLeu: 3.857 ± 0.434
1.884SerMet: 1.884 ± 0.457
2.691SerAsn: 2.691 ± 0.619
1.525SerPro: 1.525 ± 0.364
2.153SerGln: 2.153 ± 0.522
2.96SerArg: 2.96 ± 0.437
3.229SerSer: 3.229 ± 0.711
3.14SerThr: 3.14 ± 0.691
4.037SerVal: 4.037 ± 0.516
0.987SerTrp: 0.987 ± 0.279
2.243SerTyr: 2.243 ± 0.5
0.0SerXaa: 0.0 ± 0.0
Thr
4.844ThrAla: 4.844 ± 0.621
0.897ThrCys: 0.897 ± 0.346
3.588ThrAsp: 3.588 ± 0.396
3.678ThrGlu: 3.678 ± 0.632
1.615ThrPhe: 1.615 ± 0.546
5.292ThrGly: 5.292 ± 0.655
0.987ThrHis: 0.987 ± 0.217
3.588ThrIle: 3.588 ± 0.577
4.395ThrLys: 4.395 ± 0.693
4.485ThrLeu: 4.485 ± 0.591
1.525ThrMet: 1.525 ± 0.424
1.884ThrAsn: 1.884 ± 0.474
2.512ThrPro: 2.512 ± 0.397
2.063ThrGln: 2.063 ± 0.387
2.332ThrArg: 2.332 ± 0.349
2.87ThrSer: 2.87 ± 0.557
2.87ThrThr: 2.87 ± 0.469
3.409ThrVal: 3.409 ± 0.635
0.359ThrTrp: 0.359 ± 0.175
0.987ThrTyr: 0.987 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
5.651ValAla: 5.651 ± 0.727
0.449ValCys: 0.449 ± 0.2
3.229ValAsp: 3.229 ± 0.529
5.113ValGlu: 5.113 ± 0.764
1.884ValPhe: 1.884 ± 0.49
3.947ValGly: 3.947 ± 0.654
1.166ValHis: 1.166 ± 0.344
2.96ValIle: 2.96 ± 0.624
3.947ValLys: 3.947 ± 0.697
4.216ValLeu: 4.216 ± 0.534
1.704ValMet: 1.704 ± 0.309
2.96ValAsn: 2.96 ± 0.521
2.781ValPro: 2.781 ± 0.441
2.063ValGln: 2.063 ± 0.437
4.037ValArg: 4.037 ± 0.588
3.14ValSer: 3.14 ± 0.557
5.292ValThr: 5.292 ± 0.727
4.306ValVal: 4.306 ± 0.644
0.987ValTrp: 0.987 ± 0.31
2.422ValTyr: 2.422 ± 0.546
0.0ValXaa: 0.0 ± 0.0
Trp
0.628TrpAla: 0.628 ± 0.286
0.179TrpCys: 0.179 ± 0.14
0.449TrpAsp: 0.449 ± 0.19
0.897TrpGlu: 0.897 ± 0.295
0.538TrpPhe: 0.538 ± 0.213
0.807TrpGly: 0.807 ± 0.297
0.449TrpHis: 0.449 ± 0.276
0.718TrpIle: 0.718 ± 0.299
1.076TrpLys: 1.076 ± 0.356
1.435TrpLeu: 1.435 ± 0.382
0.449TrpMet: 0.449 ± 0.201
1.435TrpAsn: 1.435 ± 0.408
0.359TrpPro: 0.359 ± 0.173
0.538TrpGln: 0.538 ± 0.221
0.807TrpArg: 0.807 ± 0.284
0.987TrpSer: 0.987 ± 0.337
1.346TrpThr: 1.346 ± 0.305
1.525TrpVal: 1.525 ± 0.491
0.538TrpTrp: 0.538 ± 0.236
0.09TrpTyr: 0.09 ± 0.096
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.05TyrAla: 3.05 ± 0.478
0.269TyrCys: 0.269 ± 0.158
2.691TyrAsp: 2.691 ± 0.675
2.512TyrGlu: 2.512 ± 0.568
0.718TyrPhe: 0.718 ± 0.309
2.512TyrGly: 2.512 ± 0.447
0.449TyrHis: 0.449 ± 0.237
1.615TyrIle: 1.615 ± 0.451
1.884TyrLys: 1.884 ± 0.43
2.87TyrLeu: 2.87 ± 0.548
0.897TyrMet: 0.897 ± 0.272
1.704TyrAsn: 1.704 ± 0.3
0.987TyrPro: 0.987 ± 0.34
1.435TyrGln: 1.435 ± 0.447
1.884TyrArg: 1.884 ± 0.514
2.153TyrSer: 2.153 ± 0.395
2.243TyrThr: 2.243 ± 0.422
2.153TyrVal: 2.153 ± 0.626
0.538TyrTrp: 0.538 ± 0.27
0.449TyrTyr: 0.449 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (11149 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski