Amino acid dipepetide frequency for Klebsiella phage vB_Kp3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.351AlaAla: 13.351 ± 1.086
0.847AlaCys: 0.847 ± 0.263
6.578AlaAsp: 6.578 ± 0.552
6.773AlaGlu: 6.773 ± 0.615
2.475AlaPhe: 2.475 ± 0.482
8.271AlaGly: 8.271 ± 0.668
1.433AlaHis: 1.433 ± 0.288
6.643AlaIle: 6.643 ± 1.12
5.275AlaLys: 5.275 ± 0.746
6.122AlaLeu: 6.122 ± 0.686
3.061AlaMet: 3.061 ± 0.576
6.447AlaAsn: 6.447 ± 1.229
3.191AlaPro: 3.191 ± 0.505
4.624AlaGln: 4.624 ± 1.022
4.168AlaArg: 4.168 ± 0.721
6.252AlaSer: 6.252 ± 0.874
5.21AlaThr: 5.21 ± 0.511
6.773AlaVal: 6.773 ± 0.696
1.498AlaTrp: 1.498 ± 0.249
2.41AlaTyr: 2.41 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.586CysAla: 0.586 ± 0.231
0.065CysCys: 0.065 ± 0.067
0.716CysAsp: 0.716 ± 0.243
0.586CysGlu: 0.586 ± 0.255
0.261CysPhe: 0.261 ± 0.15
1.107CysGly: 1.107 ± 0.36
0.326CysHis: 0.326 ± 0.145
0.391CysIle: 0.391 ± 0.183
0.847CysLys: 0.847 ± 0.31
0.586CysLeu: 0.586 ± 0.199
0.13CysMet: 0.13 ± 0.106
0.521CysAsn: 0.521 ± 0.16
0.326CysPro: 0.326 ± 0.131
0.456CysGln: 0.456 ± 0.209
1.172CysArg: 1.172 ± 0.28
0.651CysSer: 0.651 ± 0.199
0.391CysThr: 0.391 ± 0.123
0.716CysVal: 0.716 ± 0.268
0.326CysTrp: 0.326 ± 0.153
0.261CysTyr: 0.261 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
7.229AspAla: 7.229 ± 0.85
0.391AspCys: 0.391 ± 0.174
4.168AspAsp: 4.168 ± 0.661
3.256AspGlu: 3.256 ± 0.548
3.191AspPhe: 3.191 ± 0.505
4.363AspGly: 4.363 ± 0.749
0.847AspHis: 0.847 ± 0.194
3.712AspIle: 3.712 ± 0.532
2.605AspLys: 2.605 ± 0.451
5.601AspLeu: 5.601 ± 0.478
1.758AspMet: 1.758 ± 0.328
2.8AspAsn: 2.8 ± 0.379
2.019AspPro: 2.019 ± 0.4
2.345AspGln: 2.345 ± 0.41
4.103AspArg: 4.103 ± 0.726
3.126AspSer: 3.126 ± 0.446
2.41AspThr: 2.41 ± 0.432
4.559AspVal: 4.559 ± 0.494
1.237AspTrp: 1.237 ± 0.281
2.345AspTyr: 2.345 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
5.926GluAla: 5.926 ± 0.972
0.847GluCys: 0.847 ± 0.29
3.517GluAsp: 3.517 ± 0.67
3.842GluGlu: 3.842 ± 0.956
1.628GluPhe: 1.628 ± 0.429
4.168GluGly: 4.168 ± 0.551
0.977GluHis: 0.977 ± 0.329
3.256GluIle: 3.256 ± 0.476
3.517GluLys: 3.517 ± 0.94
4.624GluLeu: 4.624 ± 0.754
1.889GluMet: 1.889 ± 0.517
1.758GluAsn: 1.758 ± 0.433
2.279GluPro: 2.279 ± 0.588
2.931GluGln: 2.931 ± 0.514
3.777GluArg: 3.777 ± 0.742
2.475GluSer: 2.475 ± 0.438
2.8GluThr: 2.8 ± 0.409
3.517GluVal: 3.517 ± 0.574
1.042GluTrp: 1.042 ± 0.299
2.084GluTyr: 2.084 ± 0.26
0.0GluXaa: 0.0 ± 0.0
Phe
2.67PheAla: 2.67 ± 0.553
0.521PheCys: 0.521 ± 0.222
2.605PheAsp: 2.605 ± 0.419
1.824PheGlu: 1.824 ± 0.339
0.847PhePhe: 0.847 ± 0.31
2.931PheGly: 2.931 ± 0.396
0.651PheHis: 0.651 ± 0.266
1.954PheIle: 1.954 ± 0.433
2.149PheLys: 2.149 ± 0.44
2.279PheLeu: 2.279 ± 0.497
1.107PheMet: 1.107 ± 0.246
2.41PheAsn: 2.41 ± 0.361
0.977PhePro: 0.977 ± 0.299
1.172PheGln: 1.172 ± 0.232
2.345PheArg: 2.345 ± 0.399
2.149PheSer: 2.149 ± 0.352
2.149PheThr: 2.149 ± 0.332
2.345PheVal: 2.345 ± 0.411
0.847PheTrp: 0.847 ± 0.342
1.107PheTyr: 1.107 ± 0.305
0.0PheXaa: 0.0 ± 0.0
Gly
5.666GlyAla: 5.666 ± 0.66
0.782GlyCys: 0.782 ± 0.277
4.038GlyAsp: 4.038 ± 0.43
4.298GlyGlu: 4.298 ± 0.569
2.605GlyPhe: 2.605 ± 0.41
6.903GlyGly: 6.903 ± 0.704
0.912GlyHis: 0.912 ± 0.303
5.275GlyIle: 5.275 ± 0.434
4.95GlyLys: 4.95 ± 0.572
5.861GlyLeu: 5.861 ± 0.667
1.954GlyMet: 1.954 ± 0.358
4.298GlyAsn: 4.298 ± 0.507
1.563GlyPro: 1.563 ± 0.333
4.559GlyGln: 4.559 ± 0.57
5.471GlyArg: 5.471 ± 0.541
4.754GlySer: 4.754 ± 1.231
5.601GlyThr: 5.601 ± 1.195
5.731GlyVal: 5.731 ± 0.627
1.172GlyTrp: 1.172 ± 0.428
2.735GlyTyr: 2.735 ± 0.348
0.0GlyXaa: 0.0 ± 0.0
His
1.303HisAla: 1.303 ± 0.336
0.456HisCys: 0.456 ± 0.189
0.782HisAsp: 0.782 ± 0.296
0.847HisGlu: 0.847 ± 0.309
0.391HisPhe: 0.391 ± 0.193
0.977HisGly: 0.977 ± 0.255
0.391HisHis: 0.391 ± 0.169
0.847HisIle: 0.847 ± 0.31
0.521HisLys: 0.521 ± 0.23
1.237HisLeu: 1.237 ± 0.435
0.586HisMet: 0.586 ± 0.246
0.716HisAsn: 0.716 ± 0.246
0.586HisPro: 0.586 ± 0.301
0.13HisGln: 0.13 ± 0.101
0.912HisArg: 0.912 ± 0.266
0.586HisSer: 0.586 ± 0.168
0.912HisThr: 0.912 ± 0.244
1.563HisVal: 1.563 ± 0.306
0.195HisTrp: 0.195 ± 0.133
0.782HisTyr: 0.782 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
5.666IleAla: 5.666 ± 0.683
0.651IleCys: 0.651 ± 0.229
4.494IleAsp: 4.494 ± 0.488
3.321IleGlu: 3.321 ± 0.69
1.303IlePhe: 1.303 ± 0.292
4.559IleGly: 4.559 ± 0.534
0.912IleHis: 0.912 ± 0.383
3.061IleIle: 3.061 ± 0.695
3.256IleLys: 3.256 ± 0.422
3.452IleLeu: 3.452 ± 0.677
1.368IleMet: 1.368 ± 0.444
3.126IleAsn: 3.126 ± 0.387
3.452IlePro: 3.452 ± 0.516
2.084IleGln: 2.084 ± 0.353
2.735IleArg: 2.735 ± 0.331
4.038IleSer: 4.038 ± 0.548
5.926IleThr: 5.926 ± 1.421
3.582IleVal: 3.582 ± 0.438
0.586IleTrp: 0.586 ± 0.172
1.889IleTyr: 1.889 ± 0.403
0.0IleXaa: 0.0 ± 0.0
Lys
5.405LysAla: 5.405 ± 0.548
0.521LysCys: 0.521 ± 0.172
2.931LysAsp: 2.931 ± 0.397
3.777LysGlu: 3.777 ± 0.922
2.345LysPhe: 2.345 ± 0.448
4.559LysGly: 4.559 ± 0.577
1.237LysHis: 1.237 ± 0.436
3.061LysIle: 3.061 ± 0.54
2.866LysLys: 2.866 ± 0.657
3.777LysLeu: 3.777 ± 0.643
1.954LysMet: 1.954 ± 0.516
2.019LysAsn: 2.019 ± 0.448
2.084LysPro: 2.084 ± 0.476
2.605LysGln: 2.605 ± 0.48
3.582LysArg: 3.582 ± 0.625
3.712LysSer: 3.712 ± 0.44
2.931LysThr: 2.931 ± 0.428
2.866LysVal: 2.866 ± 0.43
0.651LysTrp: 0.651 ± 0.223
1.303LysTyr: 1.303 ± 0.327
0.0LysXaa: 0.0 ± 0.0
Leu
7.555LeuAla: 7.555 ± 0.744
0.456LeuCys: 0.456 ± 0.223
4.103LeuAsp: 4.103 ± 0.848
3.387LeuGlu: 3.387 ± 0.632
2.214LeuPhe: 2.214 ± 0.511
4.363LeuGly: 4.363 ± 0.885
0.716LeuHis: 0.716 ± 0.218
3.712LeuIle: 3.712 ± 0.576
4.168LeuLys: 4.168 ± 0.778
4.559LeuLeu: 4.559 ± 0.629
1.824LeuMet: 1.824 ± 0.411
3.842LeuAsn: 3.842 ± 0.554
2.149LeuPro: 2.149 ± 0.56
2.475LeuGln: 2.475 ± 0.389
4.559LeuArg: 4.559 ± 0.696
4.95LeuSer: 4.95 ± 0.499
5.601LeuThr: 5.601 ± 1.446
5.34LeuVal: 5.34 ± 0.7
0.977LeuTrp: 0.977 ± 0.286
2.149LeuTyr: 2.149 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
3.061MetAla: 3.061 ± 0.464
0.391MetCys: 0.391 ± 0.134
1.368MetAsp: 1.368 ± 0.302
1.303MetGlu: 1.303 ± 0.377
0.716MetPhe: 0.716 ± 0.214
1.693MetGly: 1.693 ± 0.459
0.521MetHis: 0.521 ± 0.239
1.889MetIle: 1.889 ± 0.432
2.279MetLys: 2.279 ± 0.515
1.758MetLeu: 1.758 ± 0.35
0.782MetMet: 0.782 ± 0.265
1.758MetAsn: 1.758 ± 0.362
1.042MetPro: 1.042 ± 0.27
1.237MetGln: 1.237 ± 0.247
1.628MetArg: 1.628 ± 0.267
1.954MetSer: 1.954 ± 0.423
2.41MetThr: 2.41 ± 0.454
1.824MetVal: 1.824 ± 0.336
0.13MetTrp: 0.13 ± 0.1
0.456MetTyr: 0.456 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
5.926AsnAla: 5.926 ± 1.031
0.716AsnCys: 0.716 ± 0.203
2.54AsnAsp: 2.54 ± 0.315
2.54AsnGlu: 2.54 ± 0.516
1.758AsnPhe: 1.758 ± 0.347
5.275AsnGly: 5.275 ± 1.041
0.586AsnHis: 0.586 ± 0.244
2.605AsnIle: 2.605 ± 0.435
2.8AsnLys: 2.8 ± 0.367
2.735AsnLeu: 2.735 ± 0.521
1.107AsnMet: 1.107 ± 0.245
2.735AsnAsn: 2.735 ± 0.757
2.54AsnPro: 2.54 ± 0.458
1.889AsnGln: 1.889 ± 0.354
2.345AsnArg: 2.345 ± 0.356
3.452AsnSer: 3.452 ± 0.773
2.866AsnThr: 2.866 ± 0.498
2.67AsnVal: 2.67 ± 0.378
0.651AsnTrp: 0.651 ± 0.211
1.563AsnTyr: 1.563 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
3.908ProAla: 3.908 ± 0.473
0.261ProCys: 0.261 ± 0.12
2.41ProAsp: 2.41 ± 0.479
2.54ProGlu: 2.54 ± 0.665
1.758ProPhe: 1.758 ± 0.301
2.866ProGly: 2.866 ± 0.498
0.521ProHis: 0.521 ± 0.246
1.433ProIle: 1.433 ± 0.297
2.084ProLys: 2.084 ± 0.327
2.019ProLeu: 2.019 ± 0.411
0.782ProMet: 0.782 ± 0.314
1.303ProAsn: 1.303 ± 0.297
1.693ProPro: 1.693 ± 0.412
1.563ProGln: 1.563 ± 0.332
1.498ProArg: 1.498 ± 0.408
2.084ProSer: 2.084 ± 0.392
2.605ProThr: 2.605 ± 0.486
3.256ProVal: 3.256 ± 0.645
0.782ProTrp: 0.782 ± 0.286
1.303ProTyr: 1.303 ± 0.417
0.0ProXaa: 0.0 ± 0.0
Gln
5.731GlnAla: 5.731 ± 0.766
0.521GlnCys: 0.521 ± 0.228
2.345GlnAsp: 2.345 ± 0.353
2.149GlnGlu: 2.149 ± 0.508
1.563GlnPhe: 1.563 ± 0.381
3.387GlnGly: 3.387 ± 0.709
0.782GlnHis: 0.782 ± 0.206
3.256GlnIle: 3.256 ± 0.442
1.498GlnLys: 1.498 ± 0.278
4.689GlnLeu: 4.689 ± 0.558
1.563GlnMet: 1.563 ± 0.36
1.954GlnAsn: 1.954 ± 0.473
1.303GlnPro: 1.303 ± 0.323
3.842GlnGln: 3.842 ± 0.638
2.605GlnArg: 2.605 ± 0.43
2.084GlnSer: 2.084 ± 0.543
2.279GlnThr: 2.279 ± 0.416
2.54GlnVal: 2.54 ± 0.435
0.912GlnTrp: 0.912 ± 0.209
0.912GlnTyr: 0.912 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
4.624ArgAla: 4.624 ± 1.074
0.912ArgCys: 0.912 ± 0.315
3.582ArgAsp: 3.582 ± 0.655
3.452ArgGlu: 3.452 ± 0.841
2.149ArgPhe: 2.149 ± 0.377
4.038ArgGly: 4.038 ± 0.64
1.303ArgHis: 1.303 ± 0.348
4.168ArgIle: 4.168 ± 0.384
2.735ArgLys: 2.735 ± 0.619
4.429ArgLeu: 4.429 ± 0.598
1.889ArgMet: 1.889 ± 0.324
2.475ArgAsn: 2.475 ± 0.467
1.758ArgPro: 1.758 ± 0.443
2.735ArgGln: 2.735 ± 0.617
3.777ArgArg: 3.777 ± 0.597
2.931ArgSer: 2.931 ± 0.434
2.735ArgThr: 2.735 ± 0.43
4.754ArgVal: 4.754 ± 0.914
0.977ArgTrp: 0.977 ± 0.305
2.149ArgTyr: 2.149 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
7.229SerAla: 7.229 ± 1.032
0.456SerCys: 0.456 ± 0.234
3.517SerAsp: 3.517 ± 0.616
3.061SerGlu: 3.061 ± 0.412
2.996SerPhe: 2.996 ± 0.432
5.471SerGly: 5.471 ± 1.181
0.586SerHis: 0.586 ± 0.192
4.038SerIle: 4.038 ± 0.67
3.387SerLys: 3.387 ± 0.625
3.842SerLeu: 3.842 ± 0.687
1.563SerMet: 1.563 ± 0.335
2.931SerAsn: 2.931 ± 0.886
2.084SerPro: 2.084 ± 0.405
3.321SerGln: 3.321 ± 1.078
2.735SerArg: 2.735 ± 0.458
3.582SerSer: 3.582 ± 0.744
4.363SerThr: 4.363 ± 0.858
2.996SerVal: 2.996 ± 0.385
1.107SerTrp: 1.107 ± 0.226
1.693SerTyr: 1.693 ± 0.427
0.0SerXaa: 0.0 ± 0.0
Thr
7.034ThrAla: 7.034 ± 1.408
0.391ThrCys: 0.391 ± 0.208
4.819ThrAsp: 4.819 ± 0.674
2.8ThrGlu: 2.8 ± 0.332
2.475ThrPhe: 2.475 ± 0.383
5.405ThrGly: 5.405 ± 0.806
0.651ThrHis: 0.651 ± 0.264
4.168ThrIle: 4.168 ± 0.508
3.061ThrLys: 3.061 ± 0.459
4.689ThrLeu: 4.689 ± 0.52
1.824ThrMet: 1.824 ± 0.359
2.67ThrAsn: 2.67 ± 0.889
2.996ThrPro: 2.996 ± 0.51
3.126ThrGln: 3.126 ± 0.674
2.866ThrArg: 2.866 ± 0.555
4.168ThrSer: 4.168 ± 1.366
5.08ThrThr: 5.08 ± 1.038
4.298ThrVal: 4.298 ± 0.838
0.716ThrTrp: 0.716 ± 0.241
2.345ThrTyr: 2.345 ± 0.324
0.0ThrXaa: 0.0 ± 0.0
Val
5.21ValAla: 5.21 ± 0.445
0.651ValCys: 0.651 ± 0.246
4.819ValAsp: 4.819 ± 0.619
4.819ValGlu: 4.819 ± 0.542
2.279ValPhe: 2.279 ± 0.408
5.08ValGly: 5.08 ± 0.784
0.521ValHis: 0.521 ± 0.213
3.777ValIle: 3.777 ± 0.53
3.908ValLys: 3.908 ± 0.551
3.842ValLeu: 3.842 ± 0.519
1.889ValMet: 1.889 ± 0.515
3.517ValAsn: 3.517 ± 0.453
2.149ValPro: 2.149 ± 0.385
2.8ValGln: 2.8 ± 0.523
3.908ValArg: 3.908 ± 0.468
5.015ValSer: 5.015 ± 0.502
5.21ValThr: 5.21 ± 1.149
4.689ValVal: 4.689 ± 0.718
0.782ValTrp: 0.782 ± 0.242
2.279ValTyr: 2.279 ± 0.468
0.0ValXaa: 0.0 ± 0.0
Trp
1.303TrpAla: 1.303 ± 0.348
0.261TrpCys: 0.261 ± 0.154
0.391TrpAsp: 0.391 ± 0.19
0.847TrpGlu: 0.847 ± 0.29
0.716TrpPhe: 0.716 ± 0.255
0.716TrpGly: 0.716 ± 0.243
0.521TrpHis: 0.521 ± 0.252
0.847TrpIle: 0.847 ± 0.292
0.651TrpLys: 0.651 ± 0.246
1.172TrpLeu: 1.172 ± 0.38
0.326TrpMet: 0.326 ± 0.146
0.716TrpAsn: 0.716 ± 0.235
0.456TrpPro: 0.456 ± 0.176
0.716TrpGln: 0.716 ± 0.21
1.368TrpArg: 1.368 ± 0.288
0.977TrpSer: 0.977 ± 0.345
1.563TrpThr: 1.563 ± 0.353
1.107TrpVal: 1.107 ± 0.304
0.195TrpTrp: 0.195 ± 0.126
0.521TrpTyr: 0.521 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.019TyrAla: 2.019 ± 0.305
0.326TyrCys: 0.326 ± 0.162
2.345TyrAsp: 2.345 ± 0.32
1.433TyrGlu: 1.433 ± 0.275
1.368TyrPhe: 1.368 ± 0.278
2.996TyrGly: 2.996 ± 0.424
0.326TyrHis: 0.326 ± 0.197
1.368TyrIle: 1.368 ± 0.279
1.693TyrLys: 1.693 ± 0.322
1.824TyrLeu: 1.824 ± 0.451
0.782TyrMet: 0.782 ± 0.186
1.303TyrAsn: 1.303 ± 0.304
2.019TyrPro: 2.019 ± 0.415
1.303TyrGln: 1.303 ± 0.281
2.019TyrArg: 2.019 ± 0.358
2.019TyrSer: 2.019 ± 0.505
2.54TyrThr: 2.54 ± 0.451
2.019TyrVal: 2.019 ± 0.344
0.586TyrTrp: 0.586 ± 0.176
0.651TyrTyr: 0.651 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (15356 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski