Amino acid dipepetide frequency for Klebsiella phage vB_KpnS_Domnhall

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.926AlaAla: 8.926 ± 1.018
0.965AlaCys: 0.965 ± 0.255
5.308AlaAsp: 5.308 ± 0.473
6.031AlaGlu: 6.031 ± 0.745
3.136AlaPhe: 3.136 ± 0.496
6.152AlaGly: 6.152 ± 0.519
1.508AlaHis: 1.508 ± 0.338
5.549AlaIle: 5.549 ± 0.461
6.514AlaLys: 6.514 ± 0.8
6.514AlaLeu: 6.514 ± 0.691
3.136AlaMet: 3.136 ± 0.384
3.619AlaAsn: 3.619 ± 0.543
2.774AlaPro: 2.774 ± 0.387
3.559AlaGln: 3.559 ± 0.669
4.343AlaArg: 4.343 ± 0.495
5.971AlaSer: 5.971 ± 0.688
4.463AlaThr: 4.463 ± 0.634
5.79AlaVal: 5.79 ± 0.767
1.206AlaTrp: 1.206 ± 0.242
2.352AlaTyr: 2.352 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
0.844CysAla: 0.844 ± 0.229
0.181CysCys: 0.181 ± 0.1
1.689CysAsp: 1.689 ± 0.343
0.965CysGlu: 0.965 ± 0.257
0.422CysPhe: 0.422 ± 0.16
1.267CysGly: 1.267 ± 0.267
0.241CysHis: 0.241 ± 0.128
1.267CysIle: 1.267 ± 0.293
0.603CysLys: 0.603 ± 0.222
0.603CysLeu: 0.603 ± 0.192
0.302CysMet: 0.302 ± 0.15
0.362CysAsn: 0.362 ± 0.137
0.483CysPro: 0.483 ± 0.164
0.362CysGln: 0.362 ± 0.176
1.146CysArg: 1.146 ± 0.284
0.603CysSer: 0.603 ± 0.195
0.905CysThr: 0.905 ± 0.231
1.086CysVal: 1.086 ± 0.254
0.181CysTrp: 0.181 ± 0.096
0.422CysTyr: 0.422 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
6.755AspAla: 6.755 ± 0.636
0.543AspCys: 0.543 ± 0.182
3.92AspAsp: 3.92 ± 0.577
4.524AspGlu: 4.524 ± 0.511
2.835AspPhe: 2.835 ± 0.436
6.393AspGly: 6.393 ± 0.887
1.025AspHis: 1.025 ± 0.226
4.162AspIle: 4.162 ± 0.507
3.92AspLys: 3.92 ± 0.55
4.343AspLeu: 4.343 ± 0.559
1.448AspMet: 1.448 ± 0.341
2.533AspAsn: 2.533 ± 0.47
2.292AspPro: 2.292 ± 0.322
1.508AspGln: 1.508 ± 0.325
2.835AspArg: 2.835 ± 0.489
4.403AspSer: 4.403 ± 0.488
2.895AspThr: 2.895 ± 0.369
3.981AspVal: 3.981 ± 0.478
0.965AspTrp: 0.965 ± 0.293
2.654AspTyr: 2.654 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
4.885GluAla: 4.885 ± 0.473
1.206GluCys: 1.206 ± 0.241
3.498GluAsp: 3.498 ± 0.603
3.8GluGlu: 3.8 ± 0.561
4.162GluPhe: 4.162 ± 0.538
3.739GluGly: 3.739 ± 0.473
0.784GluHis: 0.784 ± 0.195
3.498GluIle: 3.498 ± 0.486
3.8GluLys: 3.8 ± 0.503
4.101GluLeu: 4.101 ± 0.602
2.593GluMet: 2.593 ± 0.365
2.533GluAsn: 2.533 ± 0.383
2.171GluPro: 2.171 ± 0.372
3.257GluGln: 3.257 ± 0.563
3.317GluArg: 3.317 ± 0.585
5.428GluSer: 5.428 ± 0.555
3.378GluThr: 3.378 ± 0.58
3.378GluVal: 3.378 ± 0.616
1.327GluTrp: 1.327 ± 0.31
3.076GluTyr: 3.076 ± 0.492
0.0GluXaa: 0.0 ± 0.0
Phe
3.498PheAla: 3.498 ± 0.487
0.905PheCys: 0.905 ± 0.271
3.076PheAsp: 3.076 ± 0.509
2.352PheGlu: 2.352 ± 0.365
1.448PhePhe: 1.448 ± 0.308
4.162PheGly: 4.162 ± 0.543
1.086PheHis: 1.086 ± 0.277
3.136PheIle: 3.136 ± 0.506
2.654PheLys: 2.654 ± 0.438
2.292PheLeu: 2.292 ± 0.341
0.784PheMet: 0.784 ± 0.24
2.473PheAsn: 2.473 ± 0.42
0.965PhePro: 0.965 ± 0.22
1.267PheGln: 1.267 ± 0.306
2.413PheArg: 2.413 ± 0.417
2.533PheSer: 2.533 ± 0.393
3.136PheThr: 3.136 ± 0.434
2.051PheVal: 2.051 ± 0.362
0.905PheTrp: 0.905 ± 0.223
1.267PheTyr: 1.267 ± 0.275
0.0PheXaa: 0.0 ± 0.0
Gly
5.609GlyAla: 5.609 ± 0.735
1.086GlyCys: 1.086 ± 0.282
4.704GlyAsp: 4.704 ± 0.45
5.428GlyGlu: 5.428 ± 0.571
3.076GlyPhe: 3.076 ± 0.377
7.961GlyGly: 7.961 ± 0.955
0.965GlyHis: 0.965 ± 0.292
4.162GlyIle: 4.162 ± 0.416
5.669GlyLys: 5.669 ± 0.624
5.247GlyLeu: 5.247 ± 0.617
2.774GlyMet: 2.774 ± 0.378
4.282GlyAsn: 4.282 ± 0.417
1.267GlyPro: 1.267 ± 0.229
2.051GlyGln: 2.051 ± 0.357
4.222GlyArg: 4.222 ± 0.378
5.187GlySer: 5.187 ± 0.824
3.559GlyThr: 3.559 ± 0.533
6.695GlyVal: 6.695 ± 0.598
1.025GlyTrp: 1.025 ± 0.265
2.714GlyTyr: 2.714 ± 0.384
0.0GlyXaa: 0.0 ± 0.0
His
1.267HisAla: 1.267 ± 0.355
0.362HisCys: 0.362 ± 0.13
1.448HisAsp: 1.448 ± 0.335
1.568HisGlu: 1.568 ± 0.369
0.905HisPhe: 0.905 ± 0.226
1.628HisGly: 1.628 ± 0.328
0.784HisHis: 0.784 ± 0.276
1.387HisIle: 1.387 ± 0.365
1.086HisLys: 1.086 ± 0.366
1.206HisLeu: 1.206 ± 0.346
0.121HisMet: 0.121 ± 0.083
0.844HisAsn: 0.844 ± 0.23
0.543HisPro: 0.543 ± 0.194
0.422HisGln: 0.422 ± 0.139
1.146HisArg: 1.146 ± 0.322
1.327HisSer: 1.327 ± 0.295
0.965HisThr: 0.965 ± 0.177
1.267HisVal: 1.267 ± 0.352
0.362HisTrp: 0.362 ± 0.133
0.844HisTyr: 0.844 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
6.273IleAla: 6.273 ± 0.74
1.146IleCys: 1.146 ± 0.246
4.162IleAsp: 4.162 ± 0.534
4.101IleGlu: 4.101 ± 0.528
2.051IlePhe: 2.051 ± 0.332
3.86IleGly: 3.86 ± 0.446
1.025IleHis: 1.025 ± 0.227
2.955IleIle: 2.955 ± 0.497
3.86IleLys: 3.86 ± 0.504
3.076IleLeu: 3.076 ± 0.5
2.171IleMet: 2.171 ± 0.426
3.136IleAsn: 3.136 ± 0.446
2.593IlePro: 2.593 ± 0.385
2.111IleGln: 2.111 ± 0.34
2.774IleArg: 2.774 ± 0.457
3.257IleSer: 3.257 ± 0.455
4.704IleThr: 4.704 ± 0.516
4.524IleVal: 4.524 ± 0.518
0.905IleTrp: 0.905 ± 0.274
1.809IleTyr: 1.809 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
5.911LysAla: 5.911 ± 0.637
0.663LysCys: 0.663 ± 0.218
3.8LysAsp: 3.8 ± 0.502
4.584LysGlu: 4.584 ± 0.43
2.413LysPhe: 2.413 ± 0.34
3.679LysGly: 3.679 ± 0.501
1.267LysHis: 1.267 ± 0.306
4.101LysIle: 4.101 ± 0.493
3.257LysLys: 3.257 ± 0.489
4.463LysLeu: 4.463 ± 0.463
2.714LysMet: 2.714 ± 0.55
2.533LysAsn: 2.533 ± 0.348
2.593LysPro: 2.593 ± 0.522
2.533LysGln: 2.533 ± 0.515
3.981LysArg: 3.981 ± 0.619
3.559LysSer: 3.559 ± 0.593
4.041LysThr: 4.041 ± 0.624
3.92LysVal: 3.92 ± 0.568
1.387LysTrp: 1.387 ± 0.288
1.749LysTyr: 1.749 ± 0.328
0.0LysXaa: 0.0 ± 0.0
Leu
6.212LeuAla: 6.212 ± 0.621
0.844LeuCys: 0.844 ± 0.211
3.86LeuAsp: 3.86 ± 0.513
3.559LeuGlu: 3.559 ± 0.553
2.232LeuPhe: 2.232 ± 0.353
4.825LeuGly: 4.825 ± 0.517
1.689LeuHis: 1.689 ± 0.444
3.438LeuIle: 3.438 ± 0.481
4.825LeuLys: 4.825 ± 0.608
4.524LeuLeu: 4.524 ± 0.494
1.99LeuMet: 1.99 ± 0.413
2.955LeuAsn: 2.955 ± 0.461
2.955LeuPro: 2.955 ± 0.411
2.413LeuGln: 2.413 ± 0.402
3.136LeuArg: 3.136 ± 0.45
4.403LeuSer: 4.403 ± 0.558
4.584LeuThr: 4.584 ± 0.624
4.403LeuVal: 4.403 ± 0.424
0.543LeuTrp: 0.543 ± 0.148
2.171LeuTyr: 2.171 ± 0.323
0.0LeuXaa: 0.0 ± 0.0
Met
3.679MetAla: 3.679 ± 0.556
0.302MetCys: 0.302 ± 0.136
1.327MetAsp: 1.327 ± 0.305
1.628MetGlu: 1.628 ± 0.302
1.689MetPhe: 1.689 ± 0.322
1.628MetGly: 1.628 ± 0.462
1.086MetHis: 1.086 ± 0.288
2.352MetIle: 2.352 ± 0.451
2.413MetLys: 2.413 ± 0.341
2.051MetLeu: 2.051 ± 0.324
1.146MetMet: 1.146 ± 0.285
1.689MetAsn: 1.689 ± 0.281
1.146MetPro: 1.146 ± 0.203
1.689MetGln: 1.689 ± 0.36
2.232MetArg: 2.232 ± 0.342
1.568MetSer: 1.568 ± 0.279
1.628MetThr: 1.628 ± 0.31
1.628MetVal: 1.628 ± 0.283
0.422MetTrp: 0.422 ± 0.167
0.965MetTyr: 0.965 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
3.86AsnAla: 3.86 ± 0.755
0.302AsnCys: 0.302 ± 0.128
2.533AsnAsp: 2.533 ± 0.37
3.076AsnGlu: 3.076 ± 0.44
1.568AsnPhe: 1.568 ± 0.29
6.212AsnGly: 6.212 ± 0.63
0.965AsnHis: 0.965 ± 0.316
2.352AsnIle: 2.352 ± 0.452
2.895AsnLys: 2.895 ± 0.453
2.473AsnLeu: 2.473 ± 0.5
1.267AsnMet: 1.267 ± 0.333
2.533AsnAsn: 2.533 ± 0.427
1.568AsnPro: 1.568 ± 0.253
1.809AsnGln: 1.809 ± 0.351
2.654AsnArg: 2.654 ± 0.321
2.232AsnSer: 2.232 ± 0.374
2.111AsnThr: 2.111 ± 0.384
3.378AsnVal: 3.378 ± 0.447
0.663AsnTrp: 0.663 ± 0.193
1.387AsnTyr: 1.387 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
1.87ProAla: 1.87 ± 0.433
0.302ProCys: 0.302 ± 0.128
3.197ProAsp: 3.197 ± 0.484
3.257ProGlu: 3.257 ± 0.509
1.508ProPhe: 1.508 ± 0.251
2.171ProGly: 2.171 ± 0.35
0.543ProHis: 0.543 ± 0.188
1.99ProIle: 1.99 ± 0.434
2.413ProLys: 2.413 ± 0.424
1.749ProLeu: 1.749 ± 0.338
0.663ProMet: 0.663 ± 0.213
1.327ProAsn: 1.327 ± 0.299
0.422ProPro: 0.422 ± 0.152
1.508ProGln: 1.508 ± 0.379
1.93ProArg: 1.93 ± 0.335
2.051ProSer: 2.051 ± 0.367
1.809ProThr: 1.809 ± 0.338
2.774ProVal: 2.774 ± 0.396
0.543ProTrp: 0.543 ± 0.193
1.086ProTyr: 1.086 ± 0.338
0.0ProXaa: 0.0 ± 0.0
Gln
3.076GlnAla: 3.076 ± 0.445
0.543GlnCys: 0.543 ± 0.187
2.593GlnAsp: 2.593 ± 0.389
1.689GlnGlu: 1.689 ± 0.327
1.327GlnPhe: 1.327 ± 0.293
2.111GlnGly: 2.111 ± 0.329
0.603GlnHis: 0.603 ± 0.201
2.895GlnIle: 2.895 ± 0.431
1.93GlnLys: 1.93 ± 0.445
3.197GlnLeu: 3.197 ± 0.528
1.628GlnMet: 1.628 ± 0.411
1.749GlnAsn: 1.749 ± 0.362
1.086GlnPro: 1.086 ± 0.355
2.111GlnGln: 2.111 ± 0.825
1.809GlnArg: 1.809 ± 0.34
2.473GlnSer: 2.473 ± 0.473
2.292GlnThr: 2.292 ± 0.361
3.136GlnVal: 3.136 ± 0.52
0.543GlnTrp: 0.543 ± 0.201
1.568GlnTyr: 1.568 ± 0.367
0.0GlnXaa: 0.0 ± 0.0
Arg
4.584ArgAla: 4.584 ± 0.444
1.086ArgCys: 1.086 ± 0.354
3.559ArgAsp: 3.559 ± 0.477
3.92ArgGlu: 3.92 ± 0.648
2.714ArgPhe: 2.714 ± 0.454
3.739ArgGly: 3.739 ± 0.374
0.724ArgHis: 0.724 ± 0.217
2.533ArgIle: 2.533 ± 0.397
3.679ArgLys: 3.679 ± 0.513
3.92ArgLeu: 3.92 ± 0.525
1.87ArgMet: 1.87 ± 0.396
1.93ArgAsn: 1.93 ± 0.326
1.749ArgPro: 1.749 ± 0.336
1.93ArgGln: 1.93 ± 0.451
2.895ArgArg: 2.895 ± 0.513
2.593ArgSer: 2.593 ± 0.387
2.413ArgThr: 2.413 ± 0.407
4.403ArgVal: 4.403 ± 0.574
0.663ArgTrp: 0.663 ± 0.178
2.051ArgTyr: 2.051 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
5.308SerAla: 5.308 ± 0.66
0.844SerCys: 0.844 ± 0.255
4.041SerAsp: 4.041 ± 0.614
4.162SerGlu: 4.162 ± 0.531
2.654SerPhe: 2.654 ± 0.346
5.971SerGly: 5.971 ± 0.588
1.508SerHis: 1.508 ± 0.357
3.257SerIle: 3.257 ± 0.489
3.739SerLys: 3.739 ± 0.507
3.86SerLeu: 3.86 ± 0.515
2.111SerMet: 2.111 ± 0.36
3.016SerAsn: 3.016 ± 0.535
2.111SerPro: 2.111 ± 0.435
2.593SerGln: 2.593 ± 0.42
3.136SerArg: 3.136 ± 0.427
2.714SerSer: 2.714 ± 0.484
3.679SerThr: 3.679 ± 0.584
3.8SerVal: 3.8 ± 0.399
1.327SerTrp: 1.327 ± 0.22
1.99SerTyr: 1.99 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
4.644ThrAla: 4.644 ± 0.619
0.965ThrCys: 0.965 ± 0.236
3.559ThrAsp: 3.559 ± 0.446
2.955ThrGlu: 2.955 ± 0.388
3.317ThrPhe: 3.317 ± 0.454
4.644ThrGly: 4.644 ± 0.549
1.206ThrHis: 1.206 ± 0.296
4.222ThrIle: 4.222 ± 0.541
3.136ThrLys: 3.136 ± 0.471
3.92ThrLeu: 3.92 ± 0.607
1.99ThrMet: 1.99 ± 0.317
2.352ThrAsn: 2.352 ± 0.349
2.051ThrPro: 2.051 ± 0.323
2.413ThrGln: 2.413 ± 0.596
2.593ThrArg: 2.593 ± 0.308
3.739ThrSer: 3.739 ± 0.6
3.016ThrThr: 3.016 ± 0.425
4.825ThrVal: 4.825 ± 0.473
0.422ThrTrp: 0.422 ± 0.15
2.051ThrTyr: 2.051 ± 0.307
0.0ThrXaa: 0.0 ± 0.0
Val
6.333ValAla: 6.333 ± 0.79
0.844ValCys: 0.844 ± 0.224
4.403ValAsp: 4.403 ± 0.554
4.101ValGlu: 4.101 ± 0.597
2.533ValPhe: 2.533 ± 0.394
4.343ValGly: 4.343 ± 0.676
1.206ValHis: 1.206 ± 0.332
4.946ValIle: 4.946 ± 0.503
3.679ValLys: 3.679 ± 0.542
4.343ValLeu: 4.343 ± 0.562
2.051ValMet: 2.051 ± 0.355
3.92ValAsn: 3.92 ± 0.432
2.593ValPro: 2.593 ± 0.386
2.292ValGln: 2.292 ± 0.432
3.257ValArg: 3.257 ± 0.409
5.066ValSer: 5.066 ± 0.63
5.127ValThr: 5.127 ± 0.606
5.609ValVal: 5.609 ± 0.754
1.267ValTrp: 1.267 ± 0.287
1.99ValTyr: 1.99 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
1.628TrpAla: 1.628 ± 0.341
0.483TrpCys: 0.483 ± 0.165
0.724TrpAsp: 0.724 ± 0.187
0.543TrpGlu: 0.543 ± 0.242
0.784TrpPhe: 0.784 ± 0.234
0.905TrpGly: 0.905 ± 0.245
0.603TrpHis: 0.603 ± 0.229
0.483TrpIle: 0.483 ± 0.164
0.965TrpLys: 0.965 ± 0.243
1.448TrpLeu: 1.448 ± 0.293
0.362TrpMet: 0.362 ± 0.161
0.362TrpAsn: 0.362 ± 0.146
0.422TrpPro: 0.422 ± 0.139
0.844TrpGln: 0.844 ± 0.178
1.387TrpArg: 1.387 ± 0.279
0.844TrpSer: 0.844 ± 0.299
0.905TrpThr: 0.905 ± 0.193
1.086TrpVal: 1.086 ± 0.228
0.362TrpTrp: 0.362 ± 0.145
0.603TrpTyr: 0.603 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.654TyrAla: 2.654 ± 0.478
0.362TyrCys: 0.362 ± 0.161
2.533TyrAsp: 2.533 ± 0.324
1.689TyrGlu: 1.689 ± 0.301
1.628TyrPhe: 1.628 ± 0.32
2.292TyrGly: 2.292 ± 0.38
0.663TyrHis: 0.663 ± 0.224
1.809TyrIle: 1.809 ± 0.261
2.111TyrLys: 2.111 ± 0.443
2.292TyrLeu: 2.292 ± 0.347
1.086TyrMet: 1.086 ± 0.243
1.689TyrAsn: 1.689 ± 0.352
1.327TyrPro: 1.327 ± 0.29
1.689TyrGln: 1.689 ± 0.299
1.749TyrArg: 1.749 ± 0.333
1.87TyrSer: 1.87 ± 0.342
2.413TyrThr: 2.413 ± 0.349
2.171TyrVal: 2.171 ± 0.355
0.724TyrTrp: 0.724 ± 0.223
0.844TyrTyr: 0.844 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (16581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski