Amino acid dipepetide frequency for Klebsiella phage KpV71

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.089AlaAla: 16.089 ± 1.432
0.517AlaCys: 0.517 ± 0.222
5.978AlaAsp: 5.978 ± 0.712
5.683AlaGlu: 5.683 ± 0.667
3.173AlaPhe: 3.173 ± 0.449
8.192AlaGly: 8.192 ± 1.119
1.107AlaHis: 1.107 ± 0.332
4.576AlaIle: 4.576 ± 0.696
5.314AlaLys: 5.314 ± 0.968
9.815AlaLeu: 9.815 ± 0.774
3.321AlaMet: 3.321 ± 0.404
2.804AlaAsn: 2.804 ± 0.496
4.354AlaPro: 4.354 ± 0.958
4.797AlaGln: 4.797 ± 0.76
5.609AlaArg: 5.609 ± 0.763
5.535AlaSer: 5.535 ± 0.717
5.683AlaThr: 5.683 ± 0.777
6.863AlaVal: 6.863 ± 0.786
1.328AlaTrp: 1.328 ± 0.372
4.649AlaTyr: 4.649 ± 0.537
0.0AlaXaa: 0.0 ± 0.0
Cys
0.812CysAla: 0.812 ± 0.321
0.369CysCys: 0.369 ± 0.213
0.517CysAsp: 0.517 ± 0.164
0.517CysGlu: 0.517 ± 0.188
0.443CysPhe: 0.443 ± 0.198
0.664CysGly: 0.664 ± 0.227
0.369CysHis: 0.369 ± 0.16
0.443CysIle: 0.443 ± 0.201
0.59CysLys: 0.59 ± 0.239
0.886CysLeu: 0.886 ± 0.219
0.59CysMet: 0.59 ± 0.194
0.443CysAsn: 0.443 ± 0.193
0.517CysPro: 0.517 ± 0.198
0.369CysGln: 0.369 ± 0.165
0.812CysArg: 0.812 ± 0.208
1.033CysSer: 1.033 ± 0.274
0.886CysThr: 0.886 ± 0.251
1.107CysVal: 1.107 ± 0.303
0.369CysTrp: 0.369 ± 0.177
0.517CysTyr: 0.517 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
7.454AspAla: 7.454 ± 0.935
0.959AspCys: 0.959 ± 0.248
3.173AspAsp: 3.173 ± 0.451
3.542AspGlu: 3.542 ± 0.532
2.509AspPhe: 2.509 ± 0.426
4.354AspGly: 4.354 ± 0.566
0.738AspHis: 0.738 ± 0.262
3.026AspIle: 3.026 ± 0.494
2.657AspLys: 2.657 ± 0.39
5.092AspLeu: 5.092 ± 0.616
2.657AspMet: 2.657 ± 0.397
2.657AspAsn: 2.657 ± 0.394
2.362AspPro: 2.362 ± 0.363
1.697AspGln: 1.697 ± 0.339
2.214AspArg: 2.214 ± 0.575
4.945AspSer: 4.945 ± 0.532
3.911AspThr: 3.911 ± 0.545
4.354AspVal: 4.354 ± 0.517
1.107AspTrp: 1.107 ± 0.215
2.066AspTyr: 2.066 ± 0.335
0.0AspXaa: 0.0 ± 0.0
Glu
5.166GluAla: 5.166 ± 0.608
0.59GluCys: 0.59 ± 0.261
2.952GluAsp: 2.952 ± 0.447
3.764GluGlu: 3.764 ± 0.792
2.583GluPhe: 2.583 ± 0.416
4.133GluGly: 4.133 ± 0.441
2.214GluHis: 2.214 ± 0.499
2.214GluIle: 2.214 ± 0.397
1.919GluLys: 1.919 ± 0.455
5.461GluLeu: 5.461 ± 0.633
2.066GluMet: 2.066 ± 0.314
2.288GluAsn: 2.288 ± 0.474
1.624GluPro: 1.624 ± 0.255
3.469GluGln: 3.469 ± 0.661
4.059GluArg: 4.059 ± 0.57
2.509GluSer: 2.509 ± 0.48
3.026GluThr: 3.026 ± 0.453
5.387GluVal: 5.387 ± 0.476
0.959GluTrp: 0.959 ± 0.246
2.435GluTyr: 2.435 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
2.657PheAla: 2.657 ± 0.423
0.443PheCys: 0.443 ± 0.223
2.14PheAsp: 2.14 ± 0.385
2.214PheGlu: 2.214 ± 0.456
1.476PhePhe: 1.476 ± 0.292
1.919PheGly: 1.919 ± 0.382
0.517PheHis: 0.517 ± 0.172
1.107PheIle: 1.107 ± 0.239
1.993PheLys: 1.993 ± 0.473
2.288PheLeu: 2.288 ± 0.395
0.59PheMet: 0.59 ± 0.228
1.402PheAsn: 1.402 ± 0.313
1.476PhePro: 1.476 ± 0.286
1.476PheGln: 1.476 ± 0.268
1.771PheArg: 1.771 ± 0.361
1.55PheSer: 1.55 ± 0.28
2.066PheThr: 2.066 ± 0.515
2.214PheVal: 2.214 ± 0.447
0.664PheTrp: 0.664 ± 0.179
1.328PheTyr: 1.328 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
5.461GlyAla: 5.461 ± 0.541
1.255GlyCys: 1.255 ± 0.309
4.354GlyAsp: 4.354 ± 0.553
3.764GlyGlu: 3.764 ± 0.397
2.583GlyPhe: 2.583 ± 0.462
4.28GlyGly: 4.28 ± 0.673
1.255GlyHis: 1.255 ± 0.312
4.576GlyIle: 4.576 ± 0.666
4.354GlyLys: 4.354 ± 0.581
6.199GlyLeu: 6.199 ± 0.651
1.771GlyMet: 1.771 ± 0.529
3.026GlyAsn: 3.026 ± 0.441
1.697GlyPro: 1.697 ± 0.34
2.657GlyGln: 2.657 ± 0.399
5.166GlyArg: 5.166 ± 0.534
5.092GlySer: 5.092 ± 0.578
5.092GlyThr: 5.092 ± 0.643
6.125GlyVal: 6.125 ± 0.804
0.886GlyTrp: 0.886 ± 0.202
3.173GlyTyr: 3.173 ± 0.478
0.0GlyXaa: 0.0 ± 0.0
His
1.55HisAla: 1.55 ± 0.394
0.221HisCys: 0.221 ± 0.112
1.107HisAsp: 1.107 ± 0.315
1.255HisGlu: 1.255 ± 0.41
0.221HisPhe: 0.221 ± 0.117
1.845HisGly: 1.845 ± 0.444
0.074HisHis: 0.074 ± 0.06
1.107HisIle: 1.107 ± 0.311
0.959HisLys: 0.959 ± 0.227
2.288HisLeu: 2.288 ± 0.414
0.664HisMet: 0.664 ± 0.193
0.812HisAsn: 0.812 ± 0.253
0.812HisPro: 0.812 ± 0.302
0.369HisGln: 0.369 ± 0.173
1.402HisArg: 1.402 ± 0.296
0.959HisSer: 0.959 ± 0.244
0.886HisThr: 0.886 ± 0.233
1.107HisVal: 1.107 ± 0.283
0.221HisTrp: 0.221 ± 0.136
0.664HisTyr: 0.664 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
3.1IleAla: 3.1 ± 0.454
0.59IleCys: 0.59 ± 0.221
2.804IleAsp: 2.804 ± 0.355
2.657IleGlu: 2.657 ± 0.507
0.59IlePhe: 0.59 ± 0.18
2.804IleGly: 2.804 ± 0.491
0.738IleHis: 0.738 ± 0.235
1.845IleIle: 1.845 ± 0.325
2.878IleLys: 2.878 ± 0.53
4.059IleLeu: 4.059 ± 0.678
1.402IleMet: 1.402 ± 0.248
2.066IleAsn: 2.066 ± 0.347
2.583IlePro: 2.583 ± 0.47
2.435IleGln: 2.435 ± 0.398
2.952IleArg: 2.952 ± 0.424
3.026IleSer: 3.026 ± 0.422
2.731IleThr: 2.731 ± 0.463
2.583IleVal: 2.583 ± 0.441
0.221IleTrp: 0.221 ± 0.141
1.181IleTyr: 1.181 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
6.642LysAla: 6.642 ± 0.846
0.517LysCys: 0.517 ± 0.194
2.731LysAsp: 2.731 ± 0.417
3.616LysGlu: 3.616 ± 0.49
1.476LysPhe: 1.476 ± 0.373
3.1LysGly: 3.1 ± 0.567
0.886LysHis: 0.886 ± 0.251
1.402LysIle: 1.402 ± 0.247
2.066LysLys: 2.066 ± 0.606
4.649LysLeu: 4.649 ± 0.726
1.402LysMet: 1.402 ± 0.312
1.328LysAsn: 1.328 ± 0.261
1.771LysPro: 1.771 ± 0.46
3.247LysGln: 3.247 ± 0.56
3.469LysArg: 3.469 ± 0.501
2.509LysSer: 2.509 ± 0.347
2.657LysThr: 2.657 ± 0.424
2.878LysVal: 2.878 ± 0.557
1.033LysTrp: 1.033 ± 0.234
1.845LysTyr: 1.845 ± 0.449
0.0LysXaa: 0.0 ± 0.0
Leu
8.118LeuAla: 8.118 ± 0.86
1.181LeuCys: 1.181 ± 0.35
6.79LeuAsp: 6.79 ± 0.669
5.683LeuGlu: 5.683 ± 0.568
2.804LeuPhe: 2.804 ± 0.396
6.494LeuGly: 6.494 ± 0.555
1.624LeuHis: 1.624 ± 0.253
4.059LeuIle: 4.059 ± 0.685
3.173LeuLys: 3.173 ± 0.473
6.642LeuLeu: 6.642 ± 0.706
1.919LeuMet: 1.919 ± 0.353
3.911LeuAsn: 3.911 ± 0.489
3.173LeuPro: 3.173 ± 0.486
4.059LeuGln: 4.059 ± 0.566
6.494LeuArg: 6.494 ± 0.658
4.723LeuSer: 4.723 ± 0.614
4.871LeuThr: 4.871 ± 0.594
6.863LeuVal: 6.863 ± 0.797
1.107LeuTrp: 1.107 ± 0.292
3.026LeuTyr: 3.026 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
3.469MetAla: 3.469 ± 0.611
0.295MetCys: 0.295 ± 0.155
1.919MetAsp: 1.919 ± 0.439
1.181MetGlu: 1.181 ± 0.294
0.664MetPhe: 0.664 ± 0.241
1.402MetGly: 1.402 ± 0.209
0.812MetHis: 0.812 ± 0.257
0.59MetIle: 0.59 ± 0.174
1.255MetLys: 1.255 ± 0.296
3.542MetLeu: 3.542 ± 0.552
0.443MetMet: 0.443 ± 0.252
0.959MetAsn: 0.959 ± 0.294
1.107MetPro: 1.107 ± 0.271
2.362MetGln: 2.362 ± 0.499
2.14MetArg: 2.14 ± 0.448
2.435MetSer: 2.435 ± 0.43
0.959MetThr: 0.959 ± 0.338
1.993MetVal: 1.993 ± 0.381
0.443MetTrp: 0.443 ± 0.169
1.033MetTyr: 1.033 ± 0.251
0.0MetXaa: 0.0 ± 0.0
Asn
2.952AsnAla: 2.952 ± 0.43
0.664AsnCys: 0.664 ± 0.346
2.362AsnAsp: 2.362 ± 0.426
1.328AsnGlu: 1.328 ± 0.28
0.812AsnPhe: 0.812 ± 0.263
3.395AsnGly: 3.395 ± 0.422
0.148AsnHis: 0.148 ± 0.09
2.435AsnIle: 2.435 ± 0.423
2.214AsnLys: 2.214 ± 0.356
2.583AsnLeu: 2.583 ± 0.452
1.107AsnMet: 1.107 ± 0.31
1.476AsnAsn: 1.476 ± 0.37
2.804AsnPro: 2.804 ± 0.481
1.55AsnGln: 1.55 ± 0.339
2.14AsnArg: 2.14 ± 0.358
2.804AsnSer: 2.804 ± 0.506
3.395AsnThr: 3.395 ± 0.516
3.026AsnVal: 3.026 ± 0.371
0.664AsnTrp: 0.664 ± 0.252
1.55AsnTyr: 1.55 ± 0.345
0.0AsnXaa: 0.0 ± 0.0
Pro
4.576ProAla: 4.576 ± 0.817
0.295ProCys: 0.295 ± 0.135
2.731ProAsp: 2.731 ± 0.427
3.247ProGlu: 3.247 ± 0.509
0.959ProPhe: 0.959 ± 0.235
2.657ProGly: 2.657 ± 0.563
0.443ProHis: 0.443 ± 0.188
1.993ProIle: 1.993 ± 0.339
1.993ProLys: 1.993 ± 0.414
2.804ProLeu: 2.804 ± 0.393
0.738ProMet: 0.738 ± 0.196
1.402ProAsn: 1.402 ± 0.377
0.59ProPro: 0.59 ± 0.19
1.033ProGln: 1.033 ± 0.219
1.771ProArg: 1.771 ± 0.389
2.804ProSer: 2.804 ± 0.622
2.362ProThr: 2.362 ± 0.412
3.321ProVal: 3.321 ± 0.417
0.664ProTrp: 0.664 ± 0.219
1.55ProTyr: 1.55 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
4.797GlnAla: 4.797 ± 0.848
0.369GlnCys: 0.369 ± 0.166
3.173GlnAsp: 3.173 ± 0.448
3.616GlnGlu: 3.616 ± 0.603
1.181GlnPhe: 1.181 ± 0.307
2.804GlnGly: 2.804 ± 0.438
1.402GlnHis: 1.402 ± 0.35
1.107GlnIle: 1.107 ± 0.293
2.435GlnLys: 2.435 ± 0.473
4.354GlnLeu: 4.354 ± 0.571
1.033GlnMet: 1.033 ± 0.238
2.214GlnAsn: 2.214 ± 0.334
1.328GlnPro: 1.328 ± 0.389
2.657GlnGln: 2.657 ± 0.617
2.878GlnArg: 2.878 ± 0.431
3.247GlnSer: 3.247 ± 0.524
1.624GlnThr: 1.624 ± 0.423
3.026GlnVal: 3.026 ± 0.477
0.664GlnTrp: 0.664 ± 0.254
1.845GlnTyr: 1.845 ± 0.394
0.0GlnXaa: 0.0 ± 0.0
Arg
7.454ArgAla: 7.454 ± 1.029
0.738ArgCys: 0.738 ± 0.242
3.321ArgAsp: 3.321 ± 0.622
3.616ArgGlu: 3.616 ± 0.499
2.288ArgPhe: 2.288 ± 0.27
4.207ArgGly: 4.207 ± 0.729
1.107ArgHis: 1.107 ± 0.227
3.026ArgIle: 3.026 ± 0.444
3.247ArgLys: 3.247 ± 0.609
5.314ArgLeu: 5.314 ± 0.522
1.993ArgMet: 1.993 ± 0.422
2.878ArgAsn: 2.878 ± 0.4
1.624ArgPro: 1.624 ± 0.355
2.657ArgGln: 2.657 ± 0.465
4.649ArgArg: 4.649 ± 0.781
3.247ArgSer: 3.247 ± 0.648
3.173ArgThr: 3.173 ± 0.422
3.69ArgVal: 3.69 ± 0.414
0.738ArgTrp: 0.738 ± 0.173
2.214ArgTyr: 2.214 ± 0.298
0.0ArgXaa: 0.0 ± 0.0
Ser
8.561SerAla: 8.561 ± 0.871
0.738SerCys: 0.738 ± 0.229
4.28SerAsp: 4.28 ± 0.483
2.731SerGlu: 2.731 ± 0.506
1.771SerPhe: 1.771 ± 0.315
5.092SerGly: 5.092 ± 0.782
0.812SerHis: 0.812 ± 0.242
2.583SerIle: 2.583 ± 0.485
3.395SerLys: 3.395 ± 0.484
4.649SerLeu: 4.649 ± 0.668
2.804SerMet: 2.804 ± 0.368
3.469SerAsn: 3.469 ± 0.78
2.14SerPro: 2.14 ± 0.299
1.993SerGln: 1.993 ± 0.359
2.804SerArg: 2.804 ± 0.397
3.469SerSer: 3.469 ± 0.612
4.354SerThr: 4.354 ± 0.637
4.059SerVal: 4.059 ± 0.473
1.181SerTrp: 1.181 ± 0.27
1.771SerTyr: 1.771 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
6.052ThrAla: 6.052 ± 0.932
0.664ThrCys: 0.664 ± 0.278
2.804ThrAsp: 2.804 ± 0.367
2.878ThrGlu: 2.878 ± 0.507
2.214ThrPhe: 2.214 ± 0.428
5.314ThrGly: 5.314 ± 0.692
1.476ThrHis: 1.476 ± 0.312
2.066ThrIle: 2.066 ± 0.479
2.509ThrLys: 2.509 ± 0.528
4.723ThrLeu: 4.723 ± 0.564
1.624ThrMet: 1.624 ± 0.361
1.55ThrAsn: 1.55 ± 0.387
3.1ThrPro: 3.1 ± 0.356
2.731ThrGln: 2.731 ± 0.402
2.804ThrArg: 2.804 ± 0.575
4.649ThrSer: 4.649 ± 0.602
3.026ThrThr: 3.026 ± 0.435
4.502ThrVal: 4.502 ± 0.771
0.738ThrTrp: 0.738 ± 0.135
2.657ThrTyr: 2.657 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
6.79ValAla: 6.79 ± 0.998
0.738ValCys: 0.738 ± 0.226
4.945ValAsp: 4.945 ± 0.534
4.133ValGlu: 4.133 ± 0.636
1.476ValPhe: 1.476 ± 0.36
6.199ValGly: 6.199 ± 0.774
2.066ValHis: 2.066 ± 0.362
2.657ValIle: 2.657 ± 0.409
3.395ValLys: 3.395 ± 0.531
6.125ValLeu: 6.125 ± 0.764
1.697ValMet: 1.697 ± 0.362
2.509ValAsn: 2.509 ± 0.552
3.247ValPro: 3.247 ± 0.483
3.395ValGln: 3.395 ± 0.758
4.354ValArg: 4.354 ± 0.527
5.092ValSer: 5.092 ± 0.823
3.395ValThr: 3.395 ± 0.593
6.125ValVal: 6.125 ± 0.549
0.812ValTrp: 0.812 ± 0.249
3.1ValTyr: 3.1 ± 0.464
0.0ValXaa: 0.0 ± 0.0
Trp
0.959TrpAla: 0.959 ± 0.231
0.369TrpCys: 0.369 ± 0.153
0.738TrpAsp: 0.738 ± 0.192
1.328TrpGlu: 1.328 ± 0.28
0.886TrpPhe: 0.886 ± 0.325
0.738TrpGly: 0.738 ± 0.292
0.221TrpHis: 0.221 ± 0.117
0.517TrpIle: 0.517 ± 0.217
0.59TrpLys: 0.59 ± 0.223
1.402TrpLeu: 1.402 ± 0.263
0.295TrpMet: 0.295 ± 0.188
0.886TrpAsn: 0.886 ± 0.249
0.369TrpPro: 0.369 ± 0.186
0.59TrpGln: 0.59 ± 0.195
1.107TrpArg: 1.107 ± 0.281
0.59TrpSer: 0.59 ± 0.234
0.812TrpThr: 0.812 ± 0.159
1.255TrpVal: 1.255 ± 0.276
0.369TrpTrp: 0.369 ± 0.132
0.959TrpTyr: 0.959 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.731TyrAla: 2.731 ± 0.494
0.738TyrCys: 0.738 ± 0.237
2.657TyrAsp: 2.657 ± 0.501
2.066TyrGlu: 2.066 ± 0.518
1.181TyrPhe: 1.181 ± 0.261
3.173TyrGly: 3.173 ± 0.473
0.517TyrHis: 0.517 ± 0.174
1.919TyrIle: 1.919 ± 0.348
2.288TyrLys: 2.288 ± 0.529
3.838TyrLeu: 3.838 ± 0.535
0.812TyrMet: 0.812 ± 0.267
1.328TyrAsn: 1.328 ± 0.31
1.255TyrPro: 1.255 ± 0.251
2.214TyrGln: 2.214 ± 0.382
2.583TyrArg: 2.583 ± 0.418
2.435TyrSer: 2.435 ± 0.29
3.173TyrThr: 3.173 ± 0.442
1.845TyrVal: 1.845 ± 0.379
0.812TyrTrp: 0.812 ± 0.259
1.476TyrTyr: 1.476 ± 0.37
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (13551 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski