Amino acid dipepetide frequency for Klebsiella phage KPN3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.758AlaAla: 7.758 ± 1.289
0.541AlaCys: 0.541 ± 0.263
5.773AlaAsp: 5.773 ± 0.959
5.232AlaGlu: 5.232 ± 0.864
3.067AlaPhe: 3.067 ± 0.556
6.495AlaGly: 6.495 ± 1.18
1.173AlaHis: 1.173 ± 0.279
3.789AlaIle: 3.789 ± 0.687
6.495AlaLys: 6.495 ± 0.862
8.209AlaLeu: 8.209 ± 1.065
3.247AlaMet: 3.247 ± 0.666
4.42AlaAsn: 4.42 ± 0.572
3.067AlaPro: 3.067 ± 0.552
4.33AlaGln: 4.33 ± 0.647
4.51AlaArg: 4.51 ± 0.685
5.773AlaSer: 5.773 ± 0.746
4.149AlaThr: 4.149 ± 0.552
5.412AlaVal: 5.412 ± 0.669
1.082AlaTrp: 1.082 ± 0.259
2.526AlaTyr: 2.526 ± 0.518
0.0AlaXaa: 0.0 ± 0.0
Cys
0.722CysAla: 0.722 ± 0.322
0.09CysCys: 0.09 ± 0.095
0.631CysAsp: 0.631 ± 0.331
0.451CysGlu: 0.451 ± 0.257
0.541CysPhe: 0.541 ± 0.278
0.631CysGly: 0.631 ± 0.247
0.18CysHis: 0.18 ± 0.138
0.631CysIle: 0.631 ± 0.239
0.361CysLys: 0.361 ± 0.174
0.812CysLeu: 0.812 ± 0.338
0.0CysMet: 0.0 ± 0.0
0.18CysAsn: 0.18 ± 0.143
0.541CysPro: 0.541 ± 0.222
0.812CysGln: 0.812 ± 0.351
0.541CysArg: 0.541 ± 0.287
0.541CysSer: 0.541 ± 0.25
0.451CysThr: 0.451 ± 0.251
0.902CysVal: 0.902 ± 0.283
0.18CysTrp: 0.18 ± 0.131
0.361CysTyr: 0.361 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
5.502AspAla: 5.502 ± 0.588
0.451AspCys: 0.451 ± 0.239
3.789AspAsp: 3.789 ± 0.547
3.698AspGlu: 3.698 ± 0.633
2.526AspPhe: 2.526 ± 0.474
5.773AspGly: 5.773 ± 0.551
0.992AspHis: 0.992 ± 0.249
2.706AspIle: 2.706 ± 0.402
4.6AspLys: 4.6 ± 0.858
3.789AspLeu: 3.789 ± 0.533
2.075AspMet: 2.075 ± 0.456
2.165AspAsn: 2.165 ± 0.343
2.796AspPro: 2.796 ± 0.446
2.616AspGln: 2.616 ± 0.487
3.067AspArg: 3.067 ± 0.459
3.608AspSer: 3.608 ± 0.419
4.059AspThr: 4.059 ± 0.558
4.24AspVal: 4.24 ± 0.485
1.082AspTrp: 1.082 ± 0.34
1.804AspTyr: 1.804 ± 0.349
0.0AspXaa: 0.0 ± 0.0
Glu
7.036GluAla: 7.036 ± 1.064
0.812GluCys: 0.812 ± 0.328
4.42GluAsp: 4.42 ± 0.719
5.502GluGlu: 5.502 ± 0.858
2.616GluPhe: 2.616 ± 0.473
5.593GluGly: 5.593 ± 0.818
1.173GluHis: 1.173 ± 0.358
2.796GluIle: 2.796 ± 0.454
2.526GluLys: 2.526 ± 0.621
5.683GluLeu: 5.683 ± 0.648
1.533GluMet: 1.533 ± 0.436
1.984GluAsn: 1.984 ± 0.355
2.436GluPro: 2.436 ± 0.701
3.608GluGln: 3.608 ± 0.868
4.24GluArg: 4.24 ± 0.781
3.969GluSer: 3.969 ± 0.554
3.067GluThr: 3.067 ± 0.555
5.142GluVal: 5.142 ± 0.685
0.361GluTrp: 0.361 ± 0.153
2.706GluTyr: 2.706 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
2.616PheAla: 2.616 ± 0.529
0.361PheCys: 0.361 ± 0.189
2.977PheAsp: 2.977 ± 0.468
2.075PheGlu: 2.075 ± 0.386
0.812PhePhe: 0.812 ± 0.222
2.887PheGly: 2.887 ± 0.627
0.361PheHis: 0.361 ± 0.146
1.984PheIle: 1.984 ± 0.537
2.526PheLys: 2.526 ± 0.431
2.887PheLeu: 2.887 ± 0.716
0.902PheMet: 0.902 ± 0.249
1.894PheAsn: 1.894 ± 0.36
1.353PhePro: 1.353 ± 0.385
1.353PheGln: 1.353 ± 0.343
2.075PheArg: 2.075 ± 0.435
2.616PheSer: 2.616 ± 0.576
3.157PheThr: 3.157 ± 0.815
2.977PheVal: 2.977 ± 0.589
0.271PheTrp: 0.271 ± 0.145
1.173PheTyr: 1.173 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
6.765GlyAla: 6.765 ± 1.074
0.902GlyCys: 0.902 ± 0.333
5.322GlyAsp: 5.322 ± 0.541
5.232GlyGlu: 5.232 ± 0.688
3.518GlyPhe: 3.518 ± 0.763
5.683GlyGly: 5.683 ± 0.832
1.353GlyHis: 1.353 ± 0.355
5.322GlyIle: 5.322 ± 0.896
5.502GlyLys: 5.502 ± 0.848
6.855GlyLeu: 6.855 ± 0.946
1.984GlyMet: 1.984 ± 0.343
3.067GlyAsn: 3.067 ± 0.53
1.082GlyPro: 1.082 ± 0.398
2.436GlyGln: 2.436 ± 0.464
4.059GlyArg: 4.059 ± 0.435
5.773GlySer: 5.773 ± 0.861
4.42GlyThr: 4.42 ± 0.733
5.051GlyVal: 5.051 ± 0.862
1.443GlyTrp: 1.443 ± 0.418
3.247GlyTyr: 3.247 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.082HisAla: 1.082 ± 0.284
0.451HisCys: 0.451 ± 0.189
0.992HisAsp: 0.992 ± 0.233
1.082HisGlu: 1.082 ± 0.362
0.722HisPhe: 0.722 ± 0.207
1.533HisGly: 1.533 ± 0.492
0.541HisHis: 0.541 ± 0.25
0.902HisIle: 0.902 ± 0.327
1.173HisLys: 1.173 ± 0.289
1.263HisLeu: 1.263 ± 0.387
0.992HisMet: 0.992 ± 0.378
0.361HisAsn: 0.361 ± 0.19
0.722HisPro: 0.722 ± 0.229
0.361HisGln: 0.361 ± 0.158
0.361HisArg: 0.361 ± 0.188
1.082HisSer: 1.082 ± 0.301
1.082HisThr: 1.082 ± 0.313
1.714HisVal: 1.714 ± 0.342
0.18HisTrp: 0.18 ± 0.106
0.812HisTyr: 0.812 ± 0.185
0.0HisXaa: 0.0 ± 0.0
Ile
4.691IleAla: 4.691 ± 0.565
0.541IleCys: 0.541 ± 0.216
3.247IleAsp: 3.247 ± 0.557
2.887IleGlu: 2.887 ± 0.547
1.082IlePhe: 1.082 ± 0.266
4.059IleGly: 4.059 ± 0.582
0.722IleHis: 0.722 ± 0.269
2.616IleIle: 2.616 ± 0.57
3.157IleLys: 3.157 ± 0.548
3.879IleLeu: 3.879 ± 0.508
1.082IleMet: 1.082 ± 0.338
2.255IleAsn: 2.255 ± 0.539
2.706IlePro: 2.706 ± 0.417
1.353IleGln: 1.353 ± 0.365
3.789IleArg: 3.789 ± 0.565
3.698IleSer: 3.698 ± 0.485
2.616IleThr: 2.616 ± 0.557
2.887IleVal: 2.887 ± 0.519
0.451IleTrp: 0.451 ± 0.201
1.533IleTyr: 1.533 ± 0.411
0.0IleXaa: 0.0 ± 0.0
Lys
7.938LysAla: 7.938 ± 1.133
0.541LysCys: 0.541 ± 0.264
3.789LysAsp: 3.789 ± 0.746
5.051LysGlu: 5.051 ± 0.707
2.436LysPhe: 2.436 ± 0.514
6.314LysGly: 6.314 ± 1.026
1.804LysHis: 1.804 ± 0.427
2.436LysIle: 2.436 ± 0.467
3.338LysLys: 3.338 ± 0.833
5.232LysLeu: 5.232 ± 0.735
1.804LysMet: 1.804 ± 0.443
2.887LysAsn: 2.887 ± 0.422
2.436LysPro: 2.436 ± 0.723
2.345LysGln: 2.345 ± 0.44
4.059LysArg: 4.059 ± 0.72
3.518LysSer: 3.518 ± 0.676
2.977LysThr: 2.977 ± 0.464
5.502LysVal: 5.502 ± 0.715
0.722LysTrp: 0.722 ± 0.327
1.263LysTyr: 1.263 ± 0.312
0.0LysXaa: 0.0 ± 0.0
Leu
7.397LeuAla: 7.397 ± 1.155
0.361LeuCys: 0.361 ± 0.198
4.149LeuAsp: 4.149 ± 0.495
6.224LeuGlu: 6.224 ± 1.085
2.796LeuPhe: 2.796 ± 0.53
5.683LeuGly: 5.683 ± 0.739
1.082LeuHis: 1.082 ± 0.354
4.149LeuIle: 4.149 ± 0.74
7.126LeuLys: 7.126 ± 0.777
5.593LeuLeu: 5.593 ± 0.735
2.977LeuMet: 2.977 ± 0.452
4.059LeuAsn: 4.059 ± 0.638
3.157LeuPro: 3.157 ± 0.518
3.428LeuGln: 3.428 ± 0.515
5.051LeuArg: 5.051 ± 0.683
5.142LeuSer: 5.142 ± 0.781
4.871LeuThr: 4.871 ± 0.708
4.871LeuVal: 4.871 ± 0.685
1.443LeuTrp: 1.443 ± 0.521
2.616LeuTyr: 2.616 ± 0.569
0.0LeuXaa: 0.0 ± 0.0
Met
3.338MetAla: 3.338 ± 0.439
0.361MetCys: 0.361 ± 0.168
2.165MetAsp: 2.165 ± 0.404
1.353MetGlu: 1.353 ± 0.357
0.902MetPhe: 0.902 ± 0.252
2.345MetGly: 2.345 ± 0.566
0.451MetHis: 0.451 ± 0.207
1.173MetIle: 1.173 ± 0.3
1.533MetLys: 1.533 ± 0.418
2.887MetLeu: 2.887 ± 0.472
0.451MetMet: 0.451 ± 0.212
1.082MetAsn: 1.082 ± 0.256
0.722MetPro: 0.722 ± 0.246
2.075MetGln: 2.075 ± 0.48
1.173MetArg: 1.173 ± 0.319
1.533MetSer: 1.533 ± 0.391
1.894MetThr: 1.894 ± 0.379
2.075MetVal: 2.075 ± 0.585
0.09MetTrp: 0.09 ± 0.096
0.812MetTyr: 0.812 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
3.698AsnAla: 3.698 ± 0.708
0.451AsnCys: 0.451 ± 0.209
2.526AsnAsp: 2.526 ± 0.516
2.345AsnGlu: 2.345 ± 0.521
1.533AsnPhe: 1.533 ± 0.314
4.24AsnGly: 4.24 ± 0.772
0.451AsnHis: 0.451 ± 0.153
3.157AsnIle: 3.157 ± 0.503
2.345AsnLys: 2.345 ± 0.401
3.157AsnLeu: 3.157 ± 0.66
0.631AsnMet: 0.631 ± 0.261
1.533AsnAsn: 1.533 ± 0.4
2.616AsnPro: 2.616 ± 0.38
1.443AsnGln: 1.443 ± 0.331
1.263AsnArg: 1.263 ± 0.456
2.887AsnSer: 2.887 ± 0.575
1.714AsnThr: 1.714 ± 0.344
3.067AsnVal: 3.067 ± 0.521
0.722AsnTrp: 0.722 ± 0.293
1.894AsnTyr: 1.894 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
2.977ProAla: 2.977 ± 0.614
0.361ProCys: 0.361 ± 0.19
2.165ProAsp: 2.165 ± 0.4
4.33ProGlu: 4.33 ± 0.723
1.533ProPhe: 1.533 ± 0.319
2.526ProGly: 2.526 ± 0.471
0.271ProHis: 0.271 ± 0.135
1.173ProIle: 1.173 ± 0.38
2.255ProLys: 2.255 ± 0.451
2.616ProLeu: 2.616 ± 0.459
0.631ProMet: 0.631 ± 0.287
1.894ProAsn: 1.894 ± 0.488
0.902ProPro: 0.902 ± 0.376
1.353ProGln: 1.353 ± 0.347
2.436ProArg: 2.436 ± 0.433
2.345ProSer: 2.345 ± 0.373
1.804ProThr: 1.804 ± 0.444
2.977ProVal: 2.977 ± 0.431
0.631ProTrp: 0.631 ± 0.21
1.353ProTyr: 1.353 ± 0.431
0.0ProXaa: 0.0 ± 0.0
Gln
3.789GlnAla: 3.789 ± 0.714
0.18GlnCys: 0.18 ± 0.141
2.526GlnAsp: 2.526 ± 0.276
3.157GlnGlu: 3.157 ± 0.575
1.804GlnPhe: 1.804 ± 0.312
2.796GlnGly: 2.796 ± 0.475
0.451GlnHis: 0.451 ± 0.205
1.714GlnIle: 1.714 ± 0.453
2.977GlnLys: 2.977 ± 0.575
4.51GlnLeu: 4.51 ± 0.471
1.714GlnMet: 1.714 ± 0.497
1.353GlnAsn: 1.353 ± 0.256
1.714GlnPro: 1.714 ± 0.265
3.608GlnGln: 3.608 ± 0.666
2.616GlnArg: 2.616 ± 0.58
2.887GlnSer: 2.887 ± 0.528
1.624GlnThr: 1.624 ± 0.493
3.067GlnVal: 3.067 ± 0.454
0.722GlnTrp: 0.722 ± 0.232
1.714GlnTyr: 1.714 ± 0.459
0.0GlnXaa: 0.0 ± 0.0
Arg
4.871ArgAla: 4.871 ± 0.869
0.631ArgCys: 0.631 ± 0.247
3.518ArgAsp: 3.518 ± 0.443
3.608ArgGlu: 3.608 ± 0.57
1.984ArgPhe: 1.984 ± 0.456
3.789ArgGly: 3.789 ± 0.662
0.992ArgHis: 0.992 ± 0.325
2.616ArgIle: 2.616 ± 0.49
4.51ArgLys: 4.51 ± 0.684
4.871ArgLeu: 4.871 ± 0.857
1.443ArgMet: 1.443 ± 0.311
3.067ArgAsn: 3.067 ± 0.671
2.165ArgPro: 2.165 ± 0.407
2.977ArgGln: 2.977 ± 0.49
2.706ArgArg: 2.706 ± 0.39
3.789ArgSer: 3.789 ± 0.617
2.526ArgThr: 2.526 ± 0.487
3.789ArgVal: 3.789 ± 0.871
0.812ArgTrp: 0.812 ± 0.307
1.173ArgTyr: 1.173 ± 0.247
0.0ArgXaa: 0.0 ± 0.0
Ser
4.24SerAla: 4.24 ± 0.695
0.902SerCys: 0.902 ± 0.45
3.698SerAsp: 3.698 ± 0.527
3.789SerGlu: 3.789 ± 0.488
3.608SerPhe: 3.608 ± 0.642
5.322SerGly: 5.322 ± 0.88
2.165SerHis: 2.165 ± 0.49
3.247SerIle: 3.247 ± 0.529
4.24SerLys: 4.24 ± 0.554
5.142SerLeu: 5.142 ± 0.87
1.984SerMet: 1.984 ± 0.506
1.984SerAsn: 1.984 ± 0.636
1.533SerPro: 1.533 ± 0.382
3.698SerGln: 3.698 ± 0.575
3.789SerArg: 3.789 ± 0.803
3.608SerSer: 3.608 ± 0.989
4.059SerThr: 4.059 ± 0.943
4.059SerVal: 4.059 ± 0.49
0.812SerTrp: 0.812 ± 0.227
2.345SerTyr: 2.345 ± 0.543
0.0SerXaa: 0.0 ± 0.0
Thr
4.691ThrAla: 4.691 ± 0.793
0.722ThrCys: 0.722 ± 0.251
3.067ThrAsp: 3.067 ± 0.458
3.247ThrGlu: 3.247 ± 0.594
1.984ThrPhe: 1.984 ± 0.442
4.871ThrGly: 4.871 ± 0.827
0.992ThrHis: 0.992 ± 0.232
3.879ThrIle: 3.879 ± 0.537
4.149ThrLys: 4.149 ± 0.67
5.232ThrLeu: 5.232 ± 0.755
1.443ThrMet: 1.443 ± 0.308
1.894ThrAsn: 1.894 ± 0.453
2.616ThrPro: 2.616 ± 0.453
2.255ThrGln: 2.255 ± 0.538
2.616ThrArg: 2.616 ± 0.51
3.338ThrSer: 3.338 ± 0.421
3.428ThrThr: 3.428 ± 0.893
3.428ThrVal: 3.428 ± 0.517
0.541ThrTrp: 0.541 ± 0.232
1.624ThrTyr: 1.624 ± 0.383
0.0ThrXaa: 0.0 ± 0.0
Val
5.232ValAla: 5.232 ± 0.757
0.451ValCys: 0.451 ± 0.218
3.247ValAsp: 3.247 ± 0.471
4.24ValGlu: 4.24 ± 0.574
2.255ValPhe: 2.255 ± 0.552
5.142ValGly: 5.142 ± 0.511
1.353ValHis: 1.353 ± 0.378
3.428ValIle: 3.428 ± 0.632
4.781ValLys: 4.781 ± 0.573
6.404ValLeu: 6.404 ± 0.88
1.533ValMet: 1.533 ± 0.369
3.428ValAsn: 3.428 ± 0.678
2.345ValPro: 2.345 ± 0.525
2.526ValGln: 2.526 ± 0.414
4.33ValArg: 4.33 ± 0.627
5.322ValSer: 5.322 ± 0.992
4.961ValThr: 4.961 ± 0.812
5.322ValVal: 5.322 ± 0.854
0.722ValTrp: 0.722 ± 0.304
2.706ValTyr: 2.706 ± 0.596
0.0ValXaa: 0.0 ± 0.0
Trp
0.451TrpAla: 0.451 ± 0.206
0.361TrpCys: 0.361 ± 0.197
0.631TrpAsp: 0.631 ± 0.204
1.263TrpGlu: 1.263 ± 0.291
0.361TrpPhe: 0.361 ± 0.172
0.541TrpGly: 0.541 ± 0.22
0.451TrpHis: 0.451 ± 0.256
0.361TrpIle: 0.361 ± 0.178
1.263TrpLys: 1.263 ± 0.354
1.173TrpLeu: 1.173 ± 0.39
0.451TrpMet: 0.451 ± 0.205
0.541TrpAsn: 0.541 ± 0.209
0.271TrpPro: 0.271 ± 0.138
0.722TrpGln: 0.722 ± 0.224
0.992TrpArg: 0.992 ± 0.23
1.263TrpSer: 1.263 ± 0.384
0.812TrpThr: 0.812 ± 0.245
1.082TrpVal: 1.082 ± 0.29
0.271TrpTrp: 0.271 ± 0.154
0.09TrpTyr: 0.09 ± 0.085
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.255TyrAla: 2.255 ± 0.482
0.09TyrCys: 0.09 ± 0.1
2.706TyrAsp: 2.706 ± 0.59
2.436TyrGlu: 2.436 ± 0.483
1.173TyrPhe: 1.173 ± 0.283
2.796TyrGly: 2.796 ± 0.475
0.451TyrHis: 0.451 ± 0.219
1.173TyrIle: 1.173 ± 0.305
1.533TyrLys: 1.533 ± 0.331
1.984TyrLeu: 1.984 ± 0.404
1.443TyrMet: 1.443 ± 0.452
1.714TyrAsn: 1.714 ± 0.504
1.353TyrPro: 1.353 ± 0.417
1.624TyrGln: 1.624 ± 0.438
2.165TyrArg: 2.165 ± 0.411
1.533TyrSer: 1.533 ± 0.516
2.345TyrThr: 2.345 ± 0.649
2.165TyrVal: 2.165 ± 0.449
0.812TyrTrp: 0.812 ± 0.242
0.902TyrTyr: 0.902 ± 0.339
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (11087 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski