Amino acid dipepetide frequency for Klebsiella phage AmPh_EK52

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.713AlaAla: 8.713 ± 0.836
0.814AlaCys: 0.814 ± 0.235
6.759AlaAsp: 6.759 ± 0.801
4.723AlaGlu: 4.723 ± 0.801
3.257AlaPhe: 3.257 ± 0.514
7.329AlaGly: 7.329 ± 0.996
1.384AlaHis: 1.384 ± 0.258
4.072AlaIle: 4.072 ± 0.521
6.433AlaLys: 6.433 ± 0.581
8.469AlaLeu: 8.469 ± 0.848
2.687AlaMet: 2.687 ± 0.548
4.235AlaAsn: 4.235 ± 0.478
2.85AlaPro: 2.85 ± 0.485
3.664AlaGln: 3.664 ± 0.585
5.293AlaArg: 5.293 ± 0.556
5.13AlaSer: 5.13 ± 0.588
3.909AlaThr: 3.909 ± 0.483
5.537AlaVal: 5.537 ± 0.668
1.059AlaTrp: 1.059 ± 0.347
3.094AlaTyr: 3.094 ± 0.449
0.0AlaXaa: 0.0 ± 0.0
Cys
0.651CysAla: 0.651 ± 0.265
0.081CysCys: 0.081 ± 0.077
0.651CysAsp: 0.651 ± 0.259
0.651CysGlu: 0.651 ± 0.277
0.407CysPhe: 0.407 ± 0.206
0.651CysGly: 0.651 ± 0.18
0.326CysHis: 0.326 ± 0.137
0.489CysIle: 0.489 ± 0.189
0.407CysLys: 0.407 ± 0.222
0.814CysLeu: 0.814 ± 0.249
0.081CysMet: 0.081 ± 0.085
0.326CysAsn: 0.326 ± 0.166
0.57CysPro: 0.57 ± 0.218
0.326CysGln: 0.326 ± 0.155
0.651CysArg: 0.651 ± 0.325
0.814CysSer: 0.814 ± 0.248
0.407CysThr: 0.407 ± 0.209
0.651CysVal: 0.651 ± 0.208
0.244CysTrp: 0.244 ± 0.156
0.326CysTyr: 0.326 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
5.537AspAla: 5.537 ± 0.73
0.489AspCys: 0.489 ± 0.217
3.909AspAsp: 3.909 ± 0.517
3.746AspGlu: 3.746 ± 0.567
2.932AspPhe: 2.932 ± 0.438
6.352AspGly: 6.352 ± 0.452
1.059AspHis: 1.059 ± 0.252
2.687AspIle: 2.687 ± 0.417
4.316AspLys: 4.316 ± 0.522
4.235AspLeu: 4.235 ± 0.684
2.28AspMet: 2.28 ± 0.392
2.443AspAsn: 2.443 ± 0.393
2.606AspPro: 2.606 ± 0.445
2.362AspGln: 2.362 ± 0.524
3.257AspArg: 3.257 ± 0.465
3.502AspSer: 3.502 ± 0.386
3.664AspThr: 3.664 ± 0.434
4.072AspVal: 4.072 ± 0.511
0.977AspTrp: 0.977 ± 0.291
2.443AspTyr: 2.443 ± 0.37
0.0AspXaa: 0.0 ± 0.0
Glu
7.329GluAla: 7.329 ± 1.017
0.896GluCys: 0.896 ± 0.361
4.316GluAsp: 4.316 ± 0.547
5.782GluGlu: 5.782 ± 1.253
2.769GluPhe: 2.769 ± 0.416
5.7GluGly: 5.7 ± 0.79
1.221GluHis: 1.221 ± 0.418
3.094GluIle: 3.094 ± 0.446
3.339GluLys: 3.339 ± 0.641
6.107GluLeu: 6.107 ± 0.84
1.71GluMet: 1.71 ± 0.527
1.873GluAsn: 1.873 ± 0.338
2.606GluPro: 2.606 ± 0.661
2.932GluGln: 2.932 ± 0.645
4.153GluArg: 4.153 ± 0.757
4.072GluSer: 4.072 ± 0.624
3.339GluThr: 3.339 ± 0.437
4.56GluVal: 4.56 ± 0.651
0.57GluTrp: 0.57 ± 0.236
3.339GluTyr: 3.339 ± 0.443
0.0GluXaa: 0.0 ± 0.0
Phe
2.687PheAla: 2.687 ± 0.475
0.326PheCys: 0.326 ± 0.174
2.932PheAsp: 2.932 ± 0.543
2.036PheGlu: 2.036 ± 0.345
0.977PhePhe: 0.977 ± 0.262
3.257PheGly: 3.257 ± 0.573
0.651PheHis: 0.651 ± 0.252
2.036PheIle: 2.036 ± 0.458
2.524PheLys: 2.524 ± 0.368
2.932PheLeu: 2.932 ± 0.507
0.977PheMet: 0.977 ± 0.255
2.036PheAsn: 2.036 ± 0.355
1.384PhePro: 1.384 ± 0.452
1.303PheGln: 1.303 ± 0.287
2.036PheArg: 2.036 ± 0.481
2.036PheSer: 2.036 ± 0.406
2.443PheThr: 2.443 ± 0.376
2.362PheVal: 2.362 ± 0.441
0.326PheTrp: 0.326 ± 0.162
1.221PheTyr: 1.221 ± 0.297
0.0PheXaa: 0.0 ± 0.0
Gly
6.678GlyAla: 6.678 ± 0.943
0.814GlyCys: 0.814 ± 0.235
5.537GlyAsp: 5.537 ± 0.499
5.863GlyGlu: 5.863 ± 0.627
2.85GlyPhe: 2.85 ± 0.346
6.596GlyGly: 6.596 ± 0.855
1.221GlyHis: 1.221 ± 0.326
4.642GlyIle: 4.642 ± 0.848
5.13GlyLys: 5.13 ± 0.725
5.7GlyLeu: 5.7 ± 0.735
1.629GlyMet: 1.629 ± 0.426
3.257GlyAsn: 3.257 ± 0.506
1.466GlyPro: 1.466 ± 0.425
2.769GlyGln: 2.769 ± 0.451
4.397GlyArg: 4.397 ± 0.351
6.189GlySer: 6.189 ± 0.645
5.375GlyThr: 5.375 ± 0.911
5.212GlyVal: 5.212 ± 0.815
1.71GlyTrp: 1.71 ± 0.391
2.85GlyTyr: 2.85 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
1.059HisAla: 1.059 ± 0.268
0.407HisCys: 0.407 ± 0.146
1.466HisAsp: 1.466 ± 0.304
1.384HisGlu: 1.384 ± 0.381
0.814HisPhe: 0.814 ± 0.275
1.466HisGly: 1.466 ± 0.375
0.407HisHis: 0.407 ± 0.161
0.814HisIle: 0.814 ± 0.211
1.14HisLys: 1.14 ± 0.308
1.547HisLeu: 1.547 ± 0.384
0.651HisMet: 0.651 ± 0.202
0.326HisAsn: 0.326 ± 0.145
0.814HisPro: 0.814 ± 0.225
0.407HisGln: 0.407 ± 0.14
0.489HisArg: 0.489 ± 0.148
0.814HisSer: 0.814 ± 0.203
0.896HisThr: 0.896 ± 0.24
1.792HisVal: 1.792 ± 0.343
0.244HisTrp: 0.244 ± 0.122
1.059HisTyr: 1.059 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
4.479IleAla: 4.479 ± 0.481
0.57IleCys: 0.57 ± 0.171
3.339IleAsp: 3.339 ± 0.443
3.013IleGlu: 3.013 ± 0.581
0.896IlePhe: 0.896 ± 0.258
3.827IleGly: 3.827 ± 0.476
1.14IleHis: 1.14 ± 0.304
2.199IleIle: 2.199 ± 0.535
2.932IleLys: 2.932 ± 0.5
3.583IleLeu: 3.583 ± 0.455
1.14IleMet: 1.14 ± 0.309
1.873IleAsn: 1.873 ± 0.501
2.443IlePro: 2.443 ± 0.431
1.954IleGln: 1.954 ± 0.395
3.746IleArg: 3.746 ± 0.599
3.094IleSer: 3.094 ± 0.458
2.524IleThr: 2.524 ± 0.466
3.176IleVal: 3.176 ± 0.435
0.407IleTrp: 0.407 ± 0.181
1.792IleTyr: 1.792 ± 0.376
0.0IleXaa: 0.0 ± 0.0
Lys
7.329LysAla: 7.329 ± 0.887
0.407LysCys: 0.407 ± 0.212
3.339LysAsp: 3.339 ± 0.557
5.049LysGlu: 5.049 ± 0.58
2.362LysPhe: 2.362 ± 0.423
6.433LysGly: 6.433 ± 0.828
1.547LysHis: 1.547 ± 0.345
2.524LysIle: 2.524 ± 0.404
3.176LysLys: 3.176 ± 0.659
5.7LysLeu: 5.7 ± 0.635
1.71LysMet: 1.71 ± 0.365
2.606LysAsn: 2.606 ± 0.434
2.524LysPro: 2.524 ± 0.519
2.28LysGln: 2.28 ± 0.384
3.583LysArg: 3.583 ± 0.62
3.583LysSer: 3.583 ± 0.548
2.932LysThr: 2.932 ± 0.425
5.212LysVal: 5.212 ± 0.683
0.814LysTrp: 0.814 ± 0.268
1.792LysTyr: 1.792 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
7.98LeuAla: 7.98 ± 1.032
0.326LeuCys: 0.326 ± 0.167
4.56LeuAsp: 4.56 ± 0.589
7.166LeuGlu: 7.166 ± 1.165
2.769LeuPhe: 2.769 ± 0.414
5.375LeuGly: 5.375 ± 0.653
1.221LeuHis: 1.221 ± 0.341
3.42LeuIle: 3.42 ± 0.504
6.596LeuLys: 6.596 ± 0.765
6.352LeuLeu: 6.352 ± 0.913
2.117LeuMet: 2.117 ± 0.298
4.072LeuAsn: 4.072 ± 0.451
3.013LeuPro: 3.013 ± 0.461
3.909LeuGln: 3.909 ± 0.511
5.293LeuArg: 5.293 ± 0.686
4.56LeuSer: 4.56 ± 0.546
5.049LeuThr: 5.049 ± 0.568
5.13LeuVal: 5.13 ± 0.547
1.303LeuTrp: 1.303 ± 0.352
2.443LeuTyr: 2.443 ± 0.423
0.0LeuXaa: 0.0 ± 0.0
Met
3.339MetAla: 3.339 ± 0.474
0.163MetCys: 0.163 ± 0.12
1.792MetAsp: 1.792 ± 0.391
1.629MetGlu: 1.629 ± 0.324
0.977MetPhe: 0.977 ± 0.292
1.547MetGly: 1.547 ± 0.324
0.407MetHis: 0.407 ± 0.225
1.303MetIle: 1.303 ± 0.323
1.221MetLys: 1.221 ± 0.301
2.932MetLeu: 2.932 ± 0.562
0.489MetMet: 0.489 ± 0.245
0.814MetAsn: 0.814 ± 0.268
0.733MetPro: 0.733 ± 0.231
2.117MetGln: 2.117 ± 0.388
1.059MetArg: 1.059 ± 0.249
1.466MetSer: 1.466 ± 0.311
2.036MetThr: 2.036 ± 0.408
1.873MetVal: 1.873 ± 0.441
0.0MetTrp: 0.0 ± 0.0
0.57MetTyr: 0.57 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
4.153AsnAla: 4.153 ± 0.6
0.489AsnCys: 0.489 ± 0.189
2.036AsnAsp: 2.036 ± 0.345
3.013AsnGlu: 3.013 ± 0.495
1.466AsnPhe: 1.466 ± 0.366
3.909AsnGly: 3.909 ± 0.704
0.57AsnHis: 0.57 ± 0.241
2.524AsnIle: 2.524 ± 0.449
2.036AsnLys: 2.036 ± 0.401
3.176AsnLeu: 3.176 ± 0.442
1.221AsnMet: 1.221 ± 0.298
1.547AsnAsn: 1.547 ± 0.317
2.199AsnPro: 2.199 ± 0.373
1.303AsnGln: 1.303 ± 0.26
1.792AsnArg: 1.792 ± 0.478
2.85AsnSer: 2.85 ± 0.55
2.28AsnThr: 2.28 ± 0.43
2.85AsnVal: 2.85 ± 0.473
0.814AsnTrp: 0.814 ± 0.303
1.792AsnTyr: 1.792 ± 0.353
0.0AsnXaa: 0.0 ± 0.0
Pro
3.094ProAla: 3.094 ± 0.448
0.407ProCys: 0.407 ± 0.202
2.199ProAsp: 2.199 ± 0.4
4.235ProGlu: 4.235 ± 0.602
1.384ProPhe: 1.384 ± 0.296
2.199ProGly: 2.199 ± 0.338
0.407ProHis: 0.407 ± 0.148
1.221ProIle: 1.221 ± 0.326
2.524ProLys: 2.524 ± 0.481
2.524ProLeu: 2.524 ± 0.412
0.651ProMet: 0.651 ± 0.208
1.873ProAsn: 1.873 ± 0.448
1.14ProPro: 1.14 ± 0.326
1.303ProGln: 1.303 ± 0.263
1.792ProArg: 1.792 ± 0.451
2.199ProSer: 2.199 ± 0.311
2.036ProThr: 2.036 ± 0.443
2.606ProVal: 2.606 ± 0.363
0.814ProTrp: 0.814 ± 0.203
1.873ProTyr: 1.873 ± 0.419
0.0ProXaa: 0.0 ± 0.0
Gln
3.827GlnAla: 3.827 ± 0.494
0.163GlnCys: 0.163 ± 0.116
2.606GlnAsp: 2.606 ± 0.331
2.85GlnGlu: 2.85 ± 0.419
1.954GlnPhe: 1.954 ± 0.351
3.013GlnGly: 3.013 ± 0.524
0.244GlnHis: 0.244 ± 0.178
1.873GlnIle: 1.873 ± 0.409
3.094GlnLys: 3.094 ± 0.478
3.99GlnLeu: 3.99 ± 0.587
1.384GlnMet: 1.384 ± 0.451
1.221GlnAsn: 1.221 ± 0.281
1.792GlnPro: 1.792 ± 0.233
3.664GlnGln: 3.664 ± 0.557
2.117GlnArg: 2.117 ± 0.44
2.443GlnSer: 2.443 ± 0.466
1.954GlnThr: 1.954 ± 0.437
2.932GlnVal: 2.932 ± 0.425
0.814GlnTrp: 0.814 ± 0.207
1.629GlnTyr: 1.629 ± 0.418
0.0GlnXaa: 0.0 ± 0.0
Arg
5.13ArgAla: 5.13 ± 0.848
0.896ArgCys: 0.896 ± 0.281
3.664ArgAsp: 3.664 ± 0.541
3.827ArgGlu: 3.827 ± 0.538
1.792ArgPhe: 1.792 ± 0.351
3.827ArgGly: 3.827 ± 0.538
0.896ArgHis: 0.896 ± 0.291
3.42ArgIle: 3.42 ± 0.595
3.746ArgLys: 3.746 ± 0.477
4.479ArgLeu: 4.479 ± 0.55
1.547ArgMet: 1.547 ± 0.258
2.443ArgAsn: 2.443 ± 0.455
2.117ArgPro: 2.117 ± 0.337
3.094ArgGln: 3.094 ± 0.487
2.524ArgArg: 2.524 ± 0.387
3.502ArgSer: 3.502 ± 0.375
2.932ArgThr: 2.932 ± 0.513
3.664ArgVal: 3.664 ± 0.647
1.14ArgTrp: 1.14 ± 0.296
1.14ArgTyr: 1.14 ± 0.251
0.0ArgXaa: 0.0 ± 0.0
Ser
4.316SerAla: 4.316 ± 0.631
0.407SerCys: 0.407 ± 0.179
4.153SerAsp: 4.153 ± 0.5
3.827SerGlu: 3.827 ± 0.541
3.176SerPhe: 3.176 ± 0.416
4.642SerGly: 4.642 ± 0.582
1.792SerHis: 1.792 ± 0.336
3.257SerIle: 3.257 ± 0.525
4.072SerLys: 4.072 ± 0.501
5.13SerLeu: 5.13 ± 0.775
1.303SerMet: 1.303 ± 0.343
1.954SerAsn: 1.954 ± 0.528
1.873SerPro: 1.873 ± 0.341
3.094SerGln: 3.094 ± 0.488
3.339SerArg: 3.339 ± 0.505
2.28SerSer: 2.28 ± 0.495
3.99SerThr: 3.99 ± 0.628
4.072SerVal: 4.072 ± 0.537
0.733SerTrp: 0.733 ± 0.17
2.117SerTyr: 2.117 ± 0.487
0.0SerXaa: 0.0 ± 0.0
Thr
4.805ThrAla: 4.805 ± 0.729
0.733ThrCys: 0.733 ± 0.245
3.176ThrAsp: 3.176 ± 0.445
3.339ThrGlu: 3.339 ± 0.505
1.954ThrPhe: 1.954 ± 0.378
5.293ThrGly: 5.293 ± 0.733
1.14ThrHis: 1.14 ± 0.251
3.257ThrIle: 3.257 ± 0.454
4.642ThrLys: 4.642 ± 0.607
5.456ThrLeu: 5.456 ± 0.648
1.466ThrMet: 1.466 ± 0.323
2.362ThrAsn: 2.362 ± 0.51
2.606ThrPro: 2.606 ± 0.371
2.28ThrGln: 2.28 ± 0.427
2.443ThrArg: 2.443 ± 0.368
3.746ThrSer: 3.746 ± 0.726
3.339ThrThr: 3.339 ± 0.755
3.99ThrVal: 3.99 ± 0.617
0.57ThrTrp: 0.57 ± 0.181
1.547ThrTyr: 1.547 ± 0.303
0.0ThrXaa: 0.0 ± 0.0
Val
5.456ValAla: 5.456 ± 0.526
0.651ValCys: 0.651 ± 0.208
3.339ValAsp: 3.339 ± 0.379
3.99ValGlu: 3.99 ± 0.563
2.28ValPhe: 2.28 ± 0.486
4.723ValGly: 4.723 ± 0.505
1.059ValHis: 1.059 ± 0.325
3.257ValIle: 3.257 ± 0.6
4.56ValLys: 4.56 ± 0.628
5.619ValLeu: 5.619 ± 0.748
1.629ValMet: 1.629 ± 0.322
4.072ValAsn: 4.072 ± 0.678
2.199ValPro: 2.199 ± 0.496
2.443ValGln: 2.443 ± 0.341
4.479ValArg: 4.479 ± 0.546
4.886ValSer: 4.886 ± 0.875
5.863ValThr: 5.863 ± 0.813
4.642ValVal: 4.642 ± 0.703
0.733ValTrp: 0.733 ± 0.262
2.28ValTyr: 2.28 ± 0.526
0.0ValXaa: 0.0 ± 0.0
Trp
0.651TrpAla: 0.651 ± 0.201
0.326TrpCys: 0.326 ± 0.149
0.651TrpAsp: 0.651 ± 0.207
1.221TrpGlu: 1.221 ± 0.215
0.407TrpPhe: 0.407 ± 0.198
0.489TrpGly: 0.489 ± 0.226
0.407TrpHis: 0.407 ± 0.204
0.57TrpIle: 0.57 ± 0.269
1.303TrpLys: 1.303 ± 0.341
1.384TrpLeu: 1.384 ± 0.315
0.244TrpMet: 0.244 ± 0.145
0.896TrpAsn: 0.896 ± 0.26
0.244TrpPro: 0.244 ± 0.135
0.651TrpGln: 0.651 ± 0.251
0.814TrpArg: 0.814 ± 0.242
1.221TrpSer: 1.221 ± 0.343
0.733TrpThr: 0.733 ± 0.226
1.384TrpVal: 1.384 ± 0.368
0.163TrpTrp: 0.163 ± 0.118
0.244TrpTyr: 0.244 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.199TyrAla: 2.199 ± 0.407
0.163TyrCys: 0.163 ± 0.123
2.524TyrAsp: 2.524 ± 0.493
2.362TyrGlu: 2.362 ± 0.451
1.059TyrPhe: 1.059 ± 0.232
3.257TyrGly: 3.257 ± 0.368
0.896TyrHis: 0.896 ± 0.255
1.466TyrIle: 1.466 ± 0.356
1.71TyrLys: 1.71 ± 0.28
2.524TyrLeu: 2.524 ± 0.47
1.466TyrMet: 1.466 ± 0.359
1.792TyrAsn: 1.792 ± 0.486
1.303TyrPro: 1.303 ± 0.301
1.71TyrGln: 1.71 ± 0.489
2.524TyrArg: 2.524 ± 0.501
1.14TyrSer: 1.14 ± 0.342
2.443TyrThr: 2.443 ± 0.564
2.443TyrVal: 2.443 ± 0.481
0.57TyrTrp: 0.57 ± 0.213
0.733TyrTyr: 0.733 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (12281 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski