Amino acid dipepetide frequency for Klebsiella phage K1-ULIP33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.975AlaAla: 9.975 ± 1.356
0.596AlaCys: 0.596 ± 0.236
4.987AlaAsp: 4.987 ± 0.685
7.593AlaGlu: 7.593 ± 1.043
3.945AlaPhe: 3.945 ± 0.7
7.667AlaGly: 7.667 ± 1.106
2.01AlaHis: 2.01 ± 0.54
4.838AlaIle: 4.838 ± 0.605
5.732AlaLys: 5.732 ± 0.736
7.22AlaLeu: 7.22 ± 0.623
3.499AlaMet: 3.499 ± 0.604
3.052AlaAsn: 3.052 ± 0.474
2.903AlaPro: 2.903 ± 0.566
4.913AlaGln: 4.913 ± 0.643
5.508AlaArg: 5.508 ± 0.654
4.838AlaSer: 4.838 ± 0.624
4.466AlaThr: 4.466 ± 0.581
7.593AlaVal: 7.593 ± 0.886
1.265AlaTrp: 1.265 ± 0.303
2.605AlaTyr: 2.605 ± 0.331
0.0AlaXaa: 0.0 ± 0.0
Cys
0.596CysAla: 0.596 ± 0.204
0.149CysCys: 0.149 ± 0.091
0.744CysAsp: 0.744 ± 0.264
0.67CysGlu: 0.67 ± 0.234
0.447CysPhe: 0.447 ± 0.228
0.744CysGly: 0.744 ± 0.264
0.223CysHis: 0.223 ± 0.146
0.596CysIle: 0.596 ± 0.25
0.893CysLys: 0.893 ± 0.31
0.893CysLeu: 0.893 ± 0.282
0.447CysMet: 0.447 ± 0.194
0.372CysAsn: 0.372 ± 0.149
0.447CysPro: 0.447 ± 0.18
0.521CysGln: 0.521 ± 0.218
0.819CysArg: 0.819 ± 0.256
0.298CysSer: 0.298 ± 0.13
0.298CysThr: 0.298 ± 0.149
0.744CysVal: 0.744 ± 0.277
0.074CysTrp: 0.074 ± 0.067
0.596CysTyr: 0.596 ± 0.261
0.0CysXaa: 0.0 ± 0.0
Asp
7.295AspAla: 7.295 ± 0.737
0.447AspCys: 0.447 ± 0.262
3.647AspAsp: 3.647 ± 0.554
4.764AspGlu: 4.764 ± 0.468
1.935AspPhe: 1.935 ± 0.335
5.434AspGly: 5.434 ± 0.895
0.893AspHis: 0.893 ± 0.354
3.722AspIle: 3.722 ± 0.503
3.722AspLys: 3.722 ± 0.467
4.615AspLeu: 4.615 ± 0.601
1.935AspMet: 1.935 ± 0.253
2.382AspAsn: 2.382 ± 0.45
1.787AspPro: 1.787 ± 0.316
1.34AspGln: 1.34 ± 0.283
3.052AspArg: 3.052 ± 0.449
3.424AspSer: 3.424 ± 0.553
3.796AspThr: 3.796 ± 0.584
4.615AspVal: 4.615 ± 0.57
1.191AspTrp: 1.191 ± 0.393
1.712AspTyr: 1.712 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
8.337GluAla: 8.337 ± 0.883
0.968GluCys: 0.968 ± 0.286
4.838GluAsp: 4.838 ± 0.477
6.551GluGlu: 6.551 ± 1.089
3.052GluPhe: 3.052 ± 0.522
6.029GluGly: 6.029 ± 0.582
1.489GluHis: 1.489 ± 0.307
3.275GluIle: 3.275 ± 0.498
3.796GluLys: 3.796 ± 0.505
6.104GluLeu: 6.104 ± 0.971
2.456GluMet: 2.456 ± 0.435
2.382GluAsn: 2.382 ± 0.484
1.861GluPro: 1.861 ± 0.35
2.754GluGln: 2.754 ± 0.411
3.722GluArg: 3.722 ± 0.486
2.978GluSer: 2.978 ± 0.549
2.754GluThr: 2.754 ± 0.492
5.806GluVal: 5.806 ± 0.874
1.563GluTrp: 1.563 ± 0.329
2.01GluTyr: 2.01 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
2.903PheAla: 2.903 ± 0.468
0.447PheCys: 0.447 ± 0.172
3.201PheAsp: 3.201 ± 0.454
2.978PheGlu: 2.978 ± 0.511
1.638PhePhe: 1.638 ± 0.355
2.754PheGly: 2.754 ± 0.579
0.67PheHis: 0.67 ± 0.175
2.382PheIle: 2.382 ± 0.278
2.754PheLys: 2.754 ± 0.439
4.094PheLeu: 4.094 ± 0.478
1.042PheMet: 1.042 ± 0.307
1.712PheAsn: 1.712 ± 0.298
1.042PhePro: 1.042 ± 0.276
0.968PheGln: 0.968 ± 0.217
1.712PheArg: 1.712 ± 0.343
2.233PheSer: 2.233 ± 0.383
2.308PheThr: 2.308 ± 0.369
1.117PheVal: 1.117 ± 0.357
0.372PheTrp: 0.372 ± 0.173
0.819PheTyr: 0.819 ± 0.264
0.0PheXaa: 0.0 ± 0.0
Gly
5.806GlyAla: 5.806 ± 0.505
0.968GlyCys: 0.968 ± 0.308
6.104GlyAsp: 6.104 ± 0.729
5.434GlyGlu: 5.434 ± 0.707
3.35GlyPhe: 3.35 ± 0.739
5.062GlyGly: 5.062 ± 0.69
2.233GlyHis: 2.233 ± 0.329
4.615GlyIle: 4.615 ± 0.514
6.923GlyLys: 6.923 ± 1.039
4.466GlyLeu: 4.466 ± 0.723
1.712GlyMet: 1.712 ± 0.324
3.126GlyAsn: 3.126 ± 0.588
0.521GlyPro: 0.521 ± 0.294
2.978GlyGln: 2.978 ± 0.486
4.615GlyArg: 4.615 ± 0.484
4.392GlySer: 4.392 ± 0.846
4.913GlyThr: 4.913 ± 0.65
4.615GlyVal: 4.615 ± 0.718
1.638GlyTrp: 1.638 ± 0.29
2.978GlyTyr: 2.978 ± 0.424
0.0GlyXaa: 0.0 ± 0.0
His
1.265HisAla: 1.265 ± 0.313
0.223HisCys: 0.223 ± 0.16
1.563HisAsp: 1.563 ± 0.387
1.117HisGlu: 1.117 ± 0.385
1.042HisPhe: 1.042 ± 0.222
1.935HisGly: 1.935 ± 0.478
0.67HisHis: 0.67 ± 0.241
1.34HisIle: 1.34 ± 0.332
1.414HisLys: 1.414 ± 0.43
2.382HisLeu: 2.382 ± 0.423
0.596HisMet: 0.596 ± 0.278
1.117HisAsn: 1.117 ± 0.262
0.819HisPro: 0.819 ± 0.25
0.596HisGln: 0.596 ± 0.222
1.414HisArg: 1.414 ± 0.455
0.744HisSer: 0.744 ± 0.211
0.893HisThr: 0.893 ± 0.25
1.265HisVal: 1.265 ± 0.287
0.223HisTrp: 0.223 ± 0.119
1.265HisTyr: 1.265 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
4.541IleAla: 4.541 ± 0.513
0.447IleCys: 0.447 ± 0.236
3.647IleAsp: 3.647 ± 0.454
2.903IleGlu: 2.903 ± 0.463
1.787IlePhe: 1.787 ± 0.534
3.647IleGly: 3.647 ± 0.484
1.191IleHis: 1.191 ± 0.248
2.903IleIle: 2.903 ± 0.589
4.317IleLys: 4.317 ± 0.495
3.35IleLeu: 3.35 ± 0.454
1.265IleMet: 1.265 ± 0.375
2.308IleAsn: 2.308 ± 0.397
2.159IlePro: 2.159 ± 0.454
2.382IleGln: 2.382 ± 0.445
3.126IleArg: 3.126 ± 0.448
3.499IleSer: 3.499 ± 0.494
2.531IleThr: 2.531 ± 0.487
3.275IleVal: 3.275 ± 0.451
0.596IleTrp: 0.596 ± 0.217
1.712IleTyr: 1.712 ± 0.399
0.0IleXaa: 0.0 ± 0.0
Lys
7.965LysAla: 7.965 ± 0.891
0.67LysCys: 0.67 ± 0.204
3.424LysAsp: 3.424 ± 0.636
4.615LysGlu: 4.615 ± 0.645
2.084LysPhe: 2.084 ± 0.385
5.36LysGly: 5.36 ± 0.658
1.489LysHis: 1.489 ± 0.463
2.01LysIle: 2.01 ± 0.378
3.647LysLys: 3.647 ± 0.75
5.285LysLeu: 5.285 ± 0.677
1.638LysMet: 1.638 ± 0.375
1.861LysAsn: 1.861 ± 0.469
2.68LysPro: 2.68 ± 0.47
2.531LysGln: 2.531 ± 0.389
4.392LysArg: 4.392 ± 0.525
3.871LysSer: 3.871 ± 0.504
2.829LysThr: 2.829 ± 0.41
4.838LysVal: 4.838 ± 0.816
0.968LysTrp: 0.968 ± 0.266
2.084LysTyr: 2.084 ± 0.4
0.0LysXaa: 0.0 ± 0.0
Leu
4.764LeuAla: 4.764 ± 0.811
1.042LeuCys: 1.042 ± 0.317
5.285LeuAsp: 5.285 ± 0.53
5.955LeuGlu: 5.955 ± 0.799
2.308LeuPhe: 2.308 ± 0.584
5.285LeuGly: 5.285 ± 0.566
1.265LeuHis: 1.265 ± 0.344
3.126LeuIle: 3.126 ± 0.451
5.36LeuLys: 5.36 ± 0.588
4.764LeuLeu: 4.764 ± 0.61
2.68LeuMet: 2.68 ± 0.438
2.978LeuAsn: 2.978 ± 0.48
3.424LeuPro: 3.424 ± 0.481
3.796LeuGln: 3.796 ± 0.809
5.136LeuArg: 5.136 ± 0.826
5.434LeuSer: 5.434 ± 0.893
4.541LeuThr: 4.541 ± 0.547
5.806LeuVal: 5.806 ± 0.627
0.67LeuTrp: 0.67 ± 0.242
2.084LeuTyr: 2.084 ± 0.374
0.0LeuXaa: 0.0 ± 0.0
Met
4.317MetAla: 4.317 ± 0.431
0.372MetCys: 0.372 ± 0.141
1.191MetAsp: 1.191 ± 0.282
1.712MetGlu: 1.712 ± 0.353
1.117MetPhe: 1.117 ± 0.276
2.382MetGly: 2.382 ± 0.56
0.819MetHis: 0.819 ± 0.231
1.34MetIle: 1.34 ± 0.401
1.638MetLys: 1.638 ± 0.301
3.275MetLeu: 3.275 ± 0.495
0.521MetMet: 0.521 ± 0.181
1.117MetAsn: 1.117 ± 0.265
1.265MetPro: 1.265 ± 0.213
1.787MetGln: 1.787 ± 0.517
2.01MetArg: 2.01 ± 0.408
2.605MetSer: 2.605 ± 0.381
2.084MetThr: 2.084 ± 0.334
1.638MetVal: 1.638 ± 0.365
0.298MetTrp: 0.298 ± 0.128
0.819MetTyr: 0.819 ± 0.27
0.0MetXaa: 0.0 ± 0.0
Asn
3.35AsnAla: 3.35 ± 0.72
0.447AsnCys: 0.447 ± 0.202
1.712AsnAsp: 1.712 ± 0.326
2.382AsnGlu: 2.382 ± 0.458
1.935AsnPhe: 1.935 ± 0.379
3.052AsnGly: 3.052 ± 0.459
0.744AsnHis: 0.744 ± 0.22
2.233AsnIle: 2.233 ± 0.318
3.126AsnLys: 3.126 ± 0.417
3.126AsnLeu: 3.126 ± 0.46
1.042AsnMet: 1.042 ± 0.22
1.563AsnAsn: 1.563 ± 0.336
1.935AsnPro: 1.935 ± 0.31
1.638AsnGln: 1.638 ± 0.434
2.903AsnArg: 2.903 ± 0.551
1.935AsnSer: 1.935 ± 0.406
2.829AsnThr: 2.829 ± 0.417
3.275AsnVal: 3.275 ± 0.536
0.447AsnTrp: 0.447 ± 0.158
1.117AsnTyr: 1.117 ± 0.297
0.0AsnXaa: 0.0 ± 0.0
Pro
3.871ProAla: 3.871 ± 0.533
0.447ProCys: 0.447 ± 0.157
2.68ProAsp: 2.68 ± 0.461
2.903ProGlu: 2.903 ± 0.482
0.968ProPhe: 0.968 ± 0.286
1.787ProGly: 1.787 ± 0.42
0.67ProHis: 0.67 ± 0.263
1.265ProIle: 1.265 ± 0.324
1.861ProLys: 1.861 ± 0.295
2.233ProLeu: 2.233 ± 0.364
1.34ProMet: 1.34 ± 0.36
1.042ProAsn: 1.042 ± 0.286
1.117ProPro: 1.117 ± 0.24
1.638ProGln: 1.638 ± 0.271
1.265ProArg: 1.265 ± 0.255
2.308ProSer: 2.308 ± 0.333
2.084ProThr: 2.084 ± 0.507
3.052ProVal: 3.052 ± 0.607
0.744ProTrp: 0.744 ± 0.192
1.414ProTyr: 1.414 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
5.211GlnAla: 5.211 ± 0.7
0.298GlnCys: 0.298 ± 0.14
2.159GlnAsp: 2.159 ± 0.499
2.68GlnGlu: 2.68 ± 0.41
1.638GlnPhe: 1.638 ± 0.332
3.573GlnGly: 3.573 ± 0.638
0.893GlnHis: 0.893 ± 0.224
2.456GlnIle: 2.456 ± 0.4
1.563GlnLys: 1.563 ± 0.356
3.126GlnLeu: 3.126 ± 0.433
1.489GlnMet: 1.489 ± 0.409
1.935GlnAsn: 1.935 ± 0.395
0.968GlnPro: 0.968 ± 0.236
3.052GlnGln: 3.052 ± 0.826
2.233GlnArg: 2.233 ± 0.539
2.68GlnSer: 2.68 ± 0.537
1.861GlnThr: 1.861 ± 0.393
2.829GlnVal: 2.829 ± 0.404
0.744GlnTrp: 0.744 ± 0.257
1.712GlnTyr: 1.712 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
5.36ArgAla: 5.36 ± 0.98
0.596ArgCys: 0.596 ± 0.249
2.978ArgAsp: 2.978 ± 0.374
4.913ArgGlu: 4.913 ± 0.667
1.638ArgPhe: 1.638 ± 0.306
3.871ArgGly: 3.871 ± 0.693
1.34ArgHis: 1.34 ± 0.324
3.871ArgIle: 3.871 ± 0.729
3.647ArgLys: 3.647 ± 0.581
4.764ArgLeu: 4.764 ± 0.669
2.978ArgMet: 2.978 ± 0.462
2.829ArgAsn: 2.829 ± 0.454
1.787ArgPro: 1.787 ± 0.415
2.233ArgGln: 2.233 ± 0.323
2.456ArgArg: 2.456 ± 0.402
2.308ArgSer: 2.308 ± 0.357
2.68ArgThr: 2.68 ± 0.466
4.243ArgVal: 4.243 ± 0.563
0.819ArgTrp: 0.819 ± 0.265
1.861ArgTyr: 1.861 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
4.466SerAla: 4.466 ± 0.683
0.447SerCys: 0.447 ± 0.201
3.573SerAsp: 3.573 ± 0.473
3.647SerGlu: 3.647 ± 0.545
2.605SerPhe: 2.605 ± 0.491
4.987SerGly: 4.987 ± 0.816
1.191SerHis: 1.191 ± 0.286
2.605SerIle: 2.605 ± 0.479
2.605SerLys: 2.605 ± 0.437
4.69SerLeu: 4.69 ± 0.679
2.233SerMet: 2.233 ± 0.412
2.754SerAsn: 2.754 ± 0.512
1.861SerPro: 1.861 ± 0.391
3.275SerGln: 3.275 ± 0.518
2.754SerArg: 2.754 ± 0.536
2.829SerSer: 2.829 ± 0.543
2.456SerThr: 2.456 ± 0.405
3.796SerVal: 3.796 ± 0.56
0.968SerTrp: 0.968 ± 0.238
2.456SerTyr: 2.456 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
3.945ThrAla: 3.945 ± 0.64
0.596ThrCys: 0.596 ± 0.233
3.424ThrAsp: 3.424 ± 0.697
3.573ThrGlu: 3.573 ± 0.46
2.382ThrPhe: 2.382 ± 0.448
4.615ThrGly: 4.615 ± 0.532
1.34ThrHis: 1.34 ± 0.367
2.978ThrIle: 2.978 ± 0.448
4.392ThrLys: 4.392 ± 0.379
3.945ThrLeu: 3.945 ± 0.689
0.893ThrMet: 0.893 ± 0.243
1.787ThrAsn: 1.787 ± 0.327
2.903ThrPro: 2.903 ± 0.52
1.935ThrGln: 1.935 ± 0.338
2.605ThrArg: 2.605 ± 0.381
3.275ThrSer: 3.275 ± 0.654
2.456ThrThr: 2.456 ± 0.513
3.35ThrVal: 3.35 ± 0.511
0.819ThrTrp: 0.819 ± 0.27
1.414ThrTyr: 1.414 ± 0.267
0.0ThrXaa: 0.0 ± 0.0
Val
7.295ValAla: 7.295 ± 0.679
0.744ValCys: 0.744 ± 0.266
3.647ValAsp: 3.647 ± 0.452
5.136ValGlu: 5.136 ± 0.573
1.787ValPhe: 1.787 ± 0.395
4.541ValGly: 4.541 ± 0.607
1.563ValHis: 1.563 ± 0.39
3.722ValIle: 3.722 ± 0.486
4.392ValLys: 4.392 ± 0.459
3.945ValLeu: 3.945 ± 0.524
2.978ValMet: 2.978 ± 0.399
3.499ValAsn: 3.499 ± 0.485
3.052ValPro: 3.052 ± 0.399
2.605ValGln: 2.605 ± 0.522
4.838ValArg: 4.838 ± 0.705
4.317ValSer: 4.317 ± 0.688
4.392ValThr: 4.392 ± 0.726
4.69ValVal: 4.69 ± 0.813
0.819ValTrp: 0.819 ± 0.289
1.861ValTyr: 1.861 ± 0.442
0.0ValXaa: 0.0 ± 0.0
Trp
1.117TrpAla: 1.117 ± 0.263
0.447TrpCys: 0.447 ± 0.19
0.893TrpAsp: 0.893 ± 0.251
1.265TrpGlu: 1.265 ± 0.289
0.447TrpPhe: 0.447 ± 0.188
0.893TrpGly: 0.893 ± 0.237
0.372TrpHis: 0.372 ± 0.173
0.298TrpIle: 0.298 ± 0.137
1.191TrpLys: 1.191 ± 0.347
0.893TrpLeu: 0.893 ± 0.238
0.372TrpMet: 0.372 ± 0.176
0.893TrpAsn: 0.893 ± 0.27
1.042TrpPro: 1.042 ± 0.339
0.968TrpGln: 0.968 ± 0.23
0.819TrpArg: 0.819 ± 0.317
0.744TrpSer: 0.744 ± 0.263
0.223TrpThr: 0.223 ± 0.144
1.265TrpVal: 1.265 ± 0.368
0.149TrpTrp: 0.149 ± 0.096
0.298TrpTyr: 0.298 ± 0.125
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.829TyrAla: 2.829 ± 0.336
0.223TyrCys: 0.223 ± 0.114
2.01TyrAsp: 2.01 ± 0.346
1.935TyrGlu: 1.935 ± 0.358
1.042TyrPhe: 1.042 ± 0.305
2.829TyrGly: 2.829 ± 0.368
0.819TyrHis: 0.819 ± 0.27
2.01TyrIle: 2.01 ± 0.25
1.265TyrLys: 1.265 ± 0.353
2.531TyrLeu: 2.531 ± 0.333
1.117TyrMet: 1.117 ± 0.325
2.084TyrAsn: 2.084 ± 0.407
1.117TyrPro: 1.117 ± 0.283
1.191TyrGln: 1.191 ± 0.363
1.787TyrArg: 1.787 ± 0.358
1.489TyrSer: 1.489 ± 0.506
2.233TyrThr: 2.233 ± 0.366
2.01TyrVal: 2.01 ± 0.415
0.298TyrTrp: 0.298 ± 0.196
0.67TyrTyr: 0.67 ± 0.259
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (13435 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski