Amino acid dipepetide frequency for Klebsiella phage VLC4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.8AlaAla: 14.8 ± 1.327
0.783AlaCys: 0.783 ± 0.238
5.835AlaAsp: 5.835 ± 0.69
5.123AlaGlu: 5.123 ± 0.456
2.917AlaPhe: 2.917 ± 0.376
8.681AlaGly: 8.681 ± 1.202
1.708AlaHis: 1.708 ± 0.412
4.412AlaIle: 4.412 ± 0.569
5.621AlaLys: 5.621 ± 1.11
8.681AlaLeu: 8.681 ± 0.93
2.775AlaMet: 2.775 ± 0.377
3.487AlaAsn: 3.487 ± 0.429
4.412AlaPro: 4.412 ± 1.088
5.123AlaGln: 5.123 ± 0.939
5.835AlaArg: 5.835 ± 0.933
5.692AlaSer: 5.692 ± 0.808
5.479AlaThr: 5.479 ± 0.756
8.183AlaVal: 8.183 ± 0.873
1.138AlaTrp: 1.138 ± 0.331
4.269AlaTyr: 4.269 ± 0.537
0.0AlaXaa: 0.0 ± 0.0
Cys
1.21CysAla: 1.21 ± 0.321
0.356CysCys: 0.356 ± 0.224
0.64CysAsp: 0.64 ± 0.226
0.427CysGlu: 0.427 ± 0.166
0.285CysPhe: 0.285 ± 0.174
0.783CysGly: 0.783 ± 0.296
0.498CysHis: 0.498 ± 0.183
0.498CysIle: 0.498 ± 0.22
0.569CysLys: 0.569 ± 0.227
0.783CysLeu: 0.783 ± 0.225
0.569CysMet: 0.569 ± 0.204
0.498CysAsn: 0.498 ± 0.208
0.569CysPro: 0.569 ± 0.245
0.285CysGln: 0.285 ± 0.152
0.783CysArg: 0.783 ± 0.226
1.138CysSer: 1.138 ± 0.31
1.138CysThr: 1.138 ± 0.311
1.067CysVal: 1.067 ± 0.291
0.071CysTrp: 0.071 ± 0.08
0.498CysTyr: 0.498 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
7.685AspAla: 7.685 ± 0.912
1.067AspCys: 1.067 ± 0.325
3.273AspAsp: 3.273 ± 0.444
3.131AspGlu: 3.131 ± 0.473
1.921AspPhe: 1.921 ± 0.39
4.696AspGly: 4.696 ± 0.514
0.854AspHis: 0.854 ± 0.297
2.988AspIle: 2.988 ± 0.358
2.704AspLys: 2.704 ± 0.458
5.265AspLeu: 5.265 ± 0.61
2.562AspMet: 2.562 ± 0.401
2.917AspAsn: 2.917 ± 0.439
2.277AspPro: 2.277 ± 0.34
1.494AspGln: 1.494 ± 0.316
2.704AspArg: 2.704 ± 0.526
4.981AspSer: 4.981 ± 0.561
4.269AspThr: 4.269 ± 0.596
3.7AspVal: 3.7 ± 0.49
1.21AspTrp: 1.21 ± 0.233
2.419AspTyr: 2.419 ± 0.452
0.0AspXaa: 0.0 ± 0.0
Glu
5.692GluAla: 5.692 ± 0.787
0.569GluCys: 0.569 ± 0.223
3.06GluAsp: 3.06 ± 0.429
4.554GluGlu: 4.554 ± 0.792
2.419GluPhe: 2.419 ± 0.373
3.842GluGly: 3.842 ± 0.568
2.49GluHis: 2.49 ± 0.492
1.921GluIle: 1.921 ± 0.36
1.921GluLys: 1.921 ± 0.4
5.763GluLeu: 5.763 ± 0.82
2.419GluMet: 2.419 ± 0.397
1.779GluAsn: 1.779 ± 0.351
1.637GluPro: 1.637 ± 0.342
3.344GluGln: 3.344 ± 0.589
3.415GluArg: 3.415 ± 0.475
2.49GluSer: 2.49 ± 0.513
2.49GluThr: 2.49 ± 0.411
5.265GluVal: 5.265 ± 0.496
1.21GluTrp: 1.21 ± 0.242
2.348GluTyr: 2.348 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
2.917PheAla: 2.917 ± 0.36
0.427PheCys: 0.427 ± 0.221
2.135PheAsp: 2.135 ± 0.397
2.063PheGlu: 2.063 ± 0.389
1.067PhePhe: 1.067 ± 0.225
2.277PheGly: 2.277 ± 0.326
0.285PheHis: 0.285 ± 0.121
1.352PheIle: 1.352 ± 0.328
1.779PheLys: 1.779 ± 0.413
2.135PheLeu: 2.135 ± 0.343
0.569PheMet: 0.569 ± 0.253
1.352PheAsn: 1.352 ± 0.308
1.494PhePro: 1.494 ± 0.292
1.138PheGln: 1.138 ± 0.235
1.494PheArg: 1.494 ± 0.386
1.565PheSer: 1.565 ± 0.361
2.348PheThr: 2.348 ± 0.514
1.85PheVal: 1.85 ± 0.442
0.427PheTrp: 0.427 ± 0.133
1.565PheTyr: 1.565 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
6.475GlyAla: 6.475 ± 0.899
1.494GlyCys: 1.494 ± 0.383
4.696GlyAsp: 4.696 ± 0.621
3.415GlyGlu: 3.415 ± 0.426
2.704GlyPhe: 2.704 ± 0.512
4.269GlyGly: 4.269 ± 0.592
1.138GlyHis: 1.138 ± 0.266
4.412GlyIle: 4.412 ± 0.684
4.412GlyLys: 4.412 ± 0.661
6.831GlyLeu: 6.831 ± 0.622
1.921GlyMet: 1.921 ± 0.484
3.7GlyAsn: 3.7 ± 0.63
1.85GlyPro: 1.85 ± 0.356
2.704GlyGln: 2.704 ± 0.394
4.554GlyArg: 4.554 ± 0.482
6.19GlySer: 6.19 ± 0.619
4.412GlyThr: 4.412 ± 0.652
5.479GlyVal: 5.479 ± 0.664
0.783GlyTrp: 0.783 ± 0.231
2.988GlyTyr: 2.988 ± 0.504
0.0GlyXaa: 0.0 ± 0.0
His
1.565HisAla: 1.565 ± 0.376
0.285HisCys: 0.285 ± 0.129
0.996HisAsp: 0.996 ± 0.259
1.423HisGlu: 1.423 ± 0.379
0.285HisPhe: 0.285 ± 0.138
1.921HisGly: 1.921 ± 0.429
0.285HisHis: 0.285 ± 0.167
1.423HisIle: 1.423 ± 0.328
1.138HisLys: 1.138 ± 0.279
2.49HisLeu: 2.49 ± 0.487
0.427HisMet: 0.427 ± 0.146
0.712HisAsn: 0.712 ± 0.225
0.854HisPro: 0.854 ± 0.342
0.569HisGln: 0.569 ± 0.254
1.85HisArg: 1.85 ± 0.323
0.996HisSer: 0.996 ± 0.243
0.712HisThr: 0.712 ± 0.217
0.783HisVal: 0.783 ± 0.249
0.213HisTrp: 0.213 ± 0.136
0.925HisTyr: 0.925 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
3.344IleAla: 3.344 ± 0.563
0.356IleCys: 0.356 ± 0.171
3.202IleAsp: 3.202 ± 0.387
2.846IleGlu: 2.846 ± 0.512
0.712IlePhe: 0.712 ± 0.202
2.633IleGly: 2.633 ± 0.362
0.783IleHis: 0.783 ± 0.198
1.921IleIle: 1.921 ± 0.331
2.988IleLys: 2.988 ± 0.469
4.767IleLeu: 4.767 ± 0.628
1.138IleMet: 1.138 ± 0.254
1.708IleAsn: 1.708 ± 0.318
2.277IlePro: 2.277 ± 0.414
2.49IleGln: 2.49 ± 0.491
2.917IleArg: 2.917 ± 0.412
3.487IleSer: 3.487 ± 0.519
2.846IleThr: 2.846 ± 0.578
2.562IleVal: 2.562 ± 0.369
0.285IleTrp: 0.285 ± 0.159
1.423IleTyr: 1.423 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
5.763LysAla: 5.763 ± 0.901
0.427LysCys: 0.427 ± 0.208
2.704LysAsp: 2.704 ± 0.448
3.629LysGlu: 3.629 ± 0.677
1.423LysPhe: 1.423 ± 0.36
3.344LysGly: 3.344 ± 0.575
1.138LysHis: 1.138 ± 0.309
1.067LysIle: 1.067 ± 0.266
2.135LysLys: 2.135 ± 0.429
4.625LysLeu: 4.625 ± 0.628
1.352LysMet: 1.352 ± 0.312
1.494LysAsn: 1.494 ± 0.273
1.637LysPro: 1.637 ± 0.365
3.06LysGln: 3.06 ± 0.432
3.415LysArg: 3.415 ± 0.467
2.846LysSer: 2.846 ± 0.422
2.704LysThr: 2.704 ± 0.357
3.131LysVal: 3.131 ± 0.403
1.067LysTrp: 1.067 ± 0.267
1.708LysTyr: 1.708 ± 0.405
0.0LysXaa: 0.0 ± 0.0
Leu
8.325LeuAla: 8.325 ± 0.78
1.281LeuCys: 1.281 ± 0.285
6.902LeuAsp: 6.902 ± 0.589
5.692LeuGlu: 5.692 ± 0.612
2.49LeuPhe: 2.49 ± 0.354
6.76LeuGly: 6.76 ± 0.648
1.423LeuHis: 1.423 ± 0.289
4.91LeuIle: 4.91 ± 0.666
3.202LeuLys: 3.202 ± 0.555
6.333LeuLeu: 6.333 ± 0.698
2.348LeuMet: 2.348 ± 0.39
3.487LeuAsn: 3.487 ± 0.478
3.273LeuPro: 3.273 ± 0.459
3.842LeuGln: 3.842 ± 0.63
6.617LeuArg: 6.617 ± 0.619
4.91LeuSer: 4.91 ± 0.575
5.123LeuThr: 5.123 ± 0.653
6.333LeuVal: 6.333 ± 0.758
1.138LeuTrp: 1.138 ± 0.278
3.202LeuTyr: 3.202 ± 0.411
0.0LeuXaa: 0.0 ± 0.0
Met
3.487MetAla: 3.487 ± 0.605
0.356MetCys: 0.356 ± 0.172
1.85MetAsp: 1.85 ± 0.47
0.854MetGlu: 0.854 ± 0.227
0.783MetPhe: 0.783 ± 0.303
1.423MetGly: 1.423 ± 0.211
0.854MetHis: 0.854 ± 0.234
0.712MetIle: 0.712 ± 0.173
1.21MetLys: 1.21 ± 0.29
3.344MetLeu: 3.344 ± 0.547
0.498MetMet: 0.498 ± 0.232
0.925MetAsn: 0.925 ± 0.277
0.996MetPro: 0.996 ± 0.242
1.992MetGln: 1.992 ± 0.405
1.921MetArg: 1.921 ± 0.487
2.277MetSer: 2.277 ± 0.572
0.925MetThr: 0.925 ± 0.291
2.348MetVal: 2.348 ± 0.44
0.498MetTrp: 0.498 ± 0.195
1.21MetTyr: 1.21 ± 0.33
0.0MetXaa: 0.0 ± 0.0
Asn
3.131AsnAla: 3.131 ± 0.445
0.64AsnCys: 0.64 ± 0.222
2.348AsnAsp: 2.348 ± 0.388
1.352AsnGlu: 1.352 ± 0.359
0.996AsnPhe: 0.996 ± 0.281
3.344AsnGly: 3.344 ± 0.546
0.213AsnHis: 0.213 ± 0.115
2.348AsnIle: 2.348 ± 0.414
2.206AsnLys: 2.206 ± 0.331
3.131AsnLeu: 3.131 ± 0.466
1.352AsnMet: 1.352 ± 0.279
1.494AsnAsn: 1.494 ± 0.391
2.562AsnPro: 2.562 ± 0.461
1.637AsnGln: 1.637 ± 0.421
1.85AsnArg: 1.85 ± 0.409
3.273AsnSer: 3.273 ± 0.594
2.846AsnThr: 2.846 ± 0.492
2.917AsnVal: 2.917 ± 0.388
0.64AsnTrp: 0.64 ± 0.224
1.352AsnTyr: 1.352 ± 0.323
0.0AsnXaa: 0.0 ± 0.0
Pro
4.198ProAla: 4.198 ± 0.763
0.213ProCys: 0.213 ± 0.114
2.562ProAsp: 2.562 ± 0.463
3.415ProGlu: 3.415 ± 0.579
1.21ProPhe: 1.21 ± 0.316
2.562ProGly: 2.562 ± 0.522
0.64ProHis: 0.64 ± 0.262
2.063ProIle: 2.063 ± 0.428
1.779ProLys: 1.779 ± 0.394
2.704ProLeu: 2.704 ± 0.375
0.996ProMet: 0.996 ± 0.266
1.067ProAsn: 1.067 ± 0.328
0.712ProPro: 0.712 ± 0.209
1.423ProGln: 1.423 ± 0.26
1.779ProArg: 1.779 ± 0.305
2.419ProSer: 2.419 ± 0.572
2.419ProThr: 2.419 ± 0.553
2.917ProVal: 2.917 ± 0.506
0.569ProTrp: 0.569 ± 0.192
1.708ProTyr: 1.708 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
4.696GlnAla: 4.696 ± 0.787
0.427GlnCys: 0.427 ± 0.184
3.131GlnAsp: 3.131 ± 0.443
3.344GlnGlu: 3.344 ± 0.473
1.281GlnPhe: 1.281 ± 0.368
2.704GlnGly: 2.704 ± 0.393
1.281GlnHis: 1.281 ± 0.278
0.854GlnIle: 0.854 ± 0.299
2.277GlnLys: 2.277 ± 0.463
4.838GlnLeu: 4.838 ± 0.529
1.138GlnMet: 1.138 ± 0.259
2.063GlnAsn: 2.063 ± 0.321
1.494GlnPro: 1.494 ± 0.42
2.917GlnGln: 2.917 ± 0.569
2.988GlnArg: 2.988 ± 0.402
2.633GlnSer: 2.633 ± 0.512
1.565GlnThr: 1.565 ± 0.373
2.562GlnVal: 2.562 ± 0.435
0.64GlnTrp: 0.64 ± 0.21
2.277GlnTyr: 2.277 ± 0.467
0.0GlnXaa: 0.0 ± 0.0
Arg
7.044ArgAla: 7.044 ± 1.182
0.498ArgCys: 0.498 ± 0.218
3.629ArgAsp: 3.629 ± 0.508
3.344ArgGlu: 3.344 ± 0.385
2.277ArgPhe: 2.277 ± 0.343
4.412ArgGly: 4.412 ± 0.919
1.352ArgHis: 1.352 ± 0.286
2.917ArgIle: 2.917 ± 0.547
3.415ArgLys: 3.415 ± 0.545
5.265ArgLeu: 5.265 ± 0.5
1.921ArgMet: 1.921 ± 0.409
2.775ArgAsn: 2.775 ± 0.399
1.352ArgPro: 1.352 ± 0.314
2.49ArgGln: 2.49 ± 0.455
4.483ArgArg: 4.483 ± 0.724
2.633ArgSer: 2.633 ± 0.502
3.558ArgThr: 3.558 ± 0.447
3.629ArgVal: 3.629 ± 0.463
0.925ArgTrp: 0.925 ± 0.232
2.49ArgTyr: 2.49 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
7.898SerAla: 7.898 ± 0.927
0.712SerCys: 0.712 ± 0.254
4.127SerAsp: 4.127 ± 0.55
3.415SerGlu: 3.415 ± 0.493
2.562SerPhe: 2.562 ± 0.422
6.19SerGly: 6.19 ± 0.967
0.569SerHis: 0.569 ± 0.209
2.988SerIle: 2.988 ± 0.536
3.558SerLys: 3.558 ± 0.593
4.412SerLeu: 4.412 ± 0.585
2.49SerMet: 2.49 ± 0.439
2.775SerAsn: 2.775 ± 0.523
2.49SerPro: 2.49 ± 0.427
1.494SerGln: 1.494 ± 0.334
3.06SerArg: 3.06 ± 0.474
4.91SerSer: 4.91 ± 1.069
3.842SerThr: 3.842 ± 0.582
4.483SerVal: 4.483 ± 0.509
0.996SerTrp: 0.996 ± 0.245
1.708SerTyr: 1.708 ± 0.399
0.0SerXaa: 0.0 ± 0.0
Thr
5.692ThrAla: 5.692 ± 0.757
0.498ThrCys: 0.498 ± 0.268
2.846ThrAsp: 2.846 ± 0.451
2.775ThrGlu: 2.775 ± 0.66
1.565ThrPhe: 1.565 ± 0.353
4.981ThrGly: 4.981 ± 0.631
1.423ThrHis: 1.423 ± 0.303
2.49ThrIle: 2.49 ± 0.399
2.49ThrLys: 2.49 ± 0.474
5.479ThrLeu: 5.479 ± 0.602
1.138ThrMet: 1.138 ± 0.33
1.708ThrAsn: 1.708 ± 0.411
2.704ThrPro: 2.704 ± 0.346
2.704ThrGln: 2.704 ± 0.411
2.419ThrArg: 2.419 ± 0.465
4.625ThrSer: 4.625 ± 0.696
2.633ThrThr: 2.633 ± 0.452
4.554ThrVal: 4.554 ± 0.685
0.996ThrTrp: 0.996 ± 0.21
2.277ThrTyr: 2.277 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
6.617ValAla: 6.617 ± 0.962
0.712ValCys: 0.712 ± 0.214
5.265ValAsp: 5.265 ± 0.575
4.554ValGlu: 4.554 ± 0.62
1.281ValPhe: 1.281 ± 0.255
5.977ValGly: 5.977 ± 0.636
1.85ValHis: 1.85 ± 0.353
3.06ValIle: 3.06 ± 0.438
2.846ValLys: 2.846 ± 0.518
5.479ValLeu: 5.479 ± 0.828
1.85ValMet: 1.85 ± 0.367
3.202ValAsn: 3.202 ± 0.562
2.704ValPro: 2.704 ± 0.438
3.771ValGln: 3.771 ± 0.607
4.696ValArg: 4.696 ± 0.524
4.838ValSer: 4.838 ± 0.666
2.846ValThr: 2.846 ± 0.63
5.692ValVal: 5.692 ± 0.793
0.712ValTrp: 0.712 ± 0.253
2.704ValTyr: 2.704 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
1.138TrpAla: 1.138 ± 0.289
0.498TrpCys: 0.498 ± 0.167
0.64TrpAsp: 0.64 ± 0.183
1.067TrpGlu: 1.067 ± 0.276
0.569TrpPhe: 0.569 ± 0.266
0.712TrpGly: 0.712 ± 0.266
0.285TrpHis: 0.285 ± 0.122
0.498TrpIle: 0.498 ± 0.179
0.427TrpLys: 0.427 ± 0.214
1.494TrpLeu: 1.494 ± 0.27
0.356TrpMet: 0.356 ± 0.192
0.854TrpAsn: 0.854 ± 0.205
0.427TrpPro: 0.427 ± 0.182
0.498TrpGln: 0.498 ± 0.158
0.996TrpArg: 0.996 ± 0.24
0.64TrpSer: 0.64 ± 0.288
1.138TrpThr: 1.138 ± 0.24
1.21TrpVal: 1.21 ± 0.254
0.64TrpTrp: 0.64 ± 0.214
0.783TrpTyr: 0.783 ± 0.237
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.273TyrAla: 3.273 ± 0.479
1.067TyrCys: 1.067 ± 0.314
2.277TyrAsp: 2.277 ± 0.398
2.277TyrGlu: 2.277 ± 0.453
1.494TyrPhe: 1.494 ± 0.312
2.917TyrGly: 2.917 ± 0.469
0.783TyrHis: 0.783 ± 0.224
1.85TyrIle: 1.85 ± 0.436
1.992TyrLys: 1.992 ± 0.387
3.771TyrLeu: 3.771 ± 0.381
0.569TyrMet: 0.569 ± 0.238
1.637TyrAsn: 1.637 ± 0.263
1.637TyrPro: 1.637 ± 0.285
2.135TyrGln: 2.135 ± 0.444
2.633TyrArg: 2.633 ± 0.424
2.135TyrSer: 2.135 ± 0.289
2.704TyrThr: 2.704 ± 0.484
1.992TyrVal: 1.992 ± 0.476
0.712TyrTrp: 0.712 ± 0.23
1.138TyrTyr: 1.138 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (14055 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski