Amino acid dipepetide frequency for Klebsiella virus KpV2811

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.807AlaAla: 12.807 ± 2.155
0.762AlaCys: 0.762 ± 0.275
5.538AlaAsp: 5.538 ± 0.8
6.369AlaGlu: 6.369 ± 0.867
2.008AlaPhe: 2.008 ± 0.31
7.546AlaGly: 7.546 ± 0.897
0.9AlaHis: 0.9 ± 0.235
7.269AlaIle: 7.269 ± 0.692
5.054AlaLys: 5.054 ± 0.672
7.2AlaLeu: 7.2 ± 0.992
3.6AlaMet: 3.6 ± 0.506
4.569AlaAsn: 4.569 ± 0.768
2.631AlaPro: 2.631 ± 0.52
3.669AlaGln: 3.669 ± 0.919
6.3AlaArg: 6.3 ± 0.881
6.715AlaSer: 6.715 ± 0.976
5.261AlaThr: 5.261 ± 0.875
6.092AlaVal: 6.092 ± 0.781
1.523AlaTrp: 1.523 ± 0.324
2.7AlaTyr: 2.7 ± 0.492
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.206
0.277CysCys: 0.277 ± 0.145
0.623CysAsp: 0.623 ± 0.176
0.762CysGlu: 0.762 ± 0.294
0.485CysPhe: 0.485 ± 0.225
1.523CysGly: 1.523 ± 0.405
0.415CysHis: 0.415 ± 0.171
0.346CysIle: 0.346 ± 0.171
0.485CysLys: 0.485 ± 0.165
0.9CysLeu: 0.9 ± 0.326
0.208CysMet: 0.208 ± 0.118
0.969CysAsn: 0.969 ± 0.307
0.692CysPro: 0.692 ± 0.219
0.415CysGln: 0.415 ± 0.148
1.108CysArg: 1.108 ± 0.3
0.415CysSer: 0.415 ± 0.141
0.554CysThr: 0.554 ± 0.196
0.692CysVal: 0.692 ± 0.236
0.485CysTrp: 0.485 ± 0.161
0.346CysTyr: 0.346 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
6.577AspAla: 6.577 ± 0.578
0.831AspCys: 0.831 ± 0.232
3.323AspAsp: 3.323 ± 0.716
4.154AspGlu: 4.154 ± 0.551
2.492AspPhe: 2.492 ± 0.444
5.469AspGly: 5.469 ± 0.652
1.315AspHis: 1.315 ± 0.396
3.808AspIle: 3.808 ± 0.559
4.292AspLys: 4.292 ± 0.586
3.6AspLeu: 3.6 ± 0.46
1.8AspMet: 1.8 ± 0.394
1.938AspAsn: 1.938 ± 0.407
1.385AspPro: 1.385 ± 0.281
1.8AspGln: 1.8 ± 0.366
2.354AspArg: 2.354 ± 0.496
3.115AspSer: 3.115 ± 0.392
2.631AspThr: 2.631 ± 0.358
4.154AspVal: 4.154 ± 0.578
0.623AspTrp: 0.623 ± 0.183
2.631AspTyr: 2.631 ± 0.502
0.0AspXaa: 0.0 ± 0.0
Glu
6.646GluAla: 6.646 ± 0.902
1.108GluCys: 1.108 ± 0.315
3.254GluAsp: 3.254 ± 0.499
4.777GluGlu: 4.777 ± 0.612
1.8GluPhe: 1.8 ± 0.349
2.908GluGly: 2.908 ± 0.56
1.108GluHis: 1.108 ± 0.315
4.361GluIle: 4.361 ± 0.769
4.638GluLys: 4.638 ± 0.752
5.746GluLeu: 5.746 ± 0.712
2.354GluMet: 2.354 ± 0.422
2.838GluAsn: 2.838 ± 0.523
2.285GluPro: 2.285 ± 0.458
3.461GluGln: 3.461 ± 0.508
3.946GluArg: 3.946 ± 0.463
2.977GluSer: 2.977 ± 0.53
2.631GluThr: 2.631 ± 0.485
3.877GluVal: 3.877 ± 0.466
1.523GluTrp: 1.523 ± 0.268
2.7GluTyr: 2.7 ± 0.402
0.0GluXaa: 0.0 ± 0.0
Phe
2.285PheAla: 2.285 ± 0.502
0.485PheCys: 0.485 ± 0.186
2.769PheAsp: 2.769 ± 0.469
1.592PheGlu: 1.592 ± 0.354
1.385PhePhe: 1.385 ± 0.295
3.184PheGly: 3.184 ± 0.399
0.485PheHis: 0.485 ± 0.197
1.8PheIle: 1.8 ± 0.406
1.938PheLys: 1.938 ± 0.428
1.731PheLeu: 1.731 ± 0.367
1.385PheMet: 1.385 ± 0.308
1.523PheAsn: 1.523 ± 0.29
0.831PhePro: 0.831 ± 0.187
0.623PheGln: 0.623 ± 0.168
1.8PheArg: 1.8 ± 0.345
2.354PheSer: 2.354 ± 0.383
1.8PheThr: 1.8 ± 0.398
1.731PheVal: 1.731 ± 0.398
0.485PheTrp: 0.485 ± 0.15
1.108PheTyr: 1.108 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
5.538GlyAla: 5.538 ± 0.785
0.9GlyCys: 0.9 ± 0.266
3.669GlyAsp: 3.669 ± 0.538
4.915GlyGlu: 4.915 ± 0.58
2.769GlyPhe: 2.769 ± 0.442
5.4GlyGly: 5.4 ± 0.561
0.692GlyHis: 0.692 ± 0.215
3.808GlyIle: 3.808 ± 0.455
5.954GlyLys: 5.954 ± 0.68
4.5GlyLeu: 4.5 ± 0.532
2.215GlyMet: 2.215 ± 0.356
3.531GlyAsn: 3.531 ± 0.493
1.246GlyPro: 1.246 ± 0.273
3.115GlyGln: 3.115 ± 0.582
3.738GlyArg: 3.738 ± 0.482
4.708GlySer: 4.708 ± 0.599
4.154GlyThr: 4.154 ± 0.902
6.369GlyVal: 6.369 ± 0.602
1.523GlyTrp: 1.523 ± 0.338
3.115GlyTyr: 3.115 ± 0.494
0.0GlyXaa: 0.0 ± 0.0
His
1.592HisAla: 1.592 ± 0.397
0.554HisCys: 0.554 ± 0.21
1.246HisAsp: 1.246 ± 0.318
1.177HisGlu: 1.177 ± 0.326
0.831HisPhe: 0.831 ± 0.271
1.731HisGly: 1.731 ± 0.421
0.346HisHis: 0.346 ± 0.147
1.108HisIle: 1.108 ± 0.359
1.038HisLys: 1.038 ± 0.271
1.246HisLeu: 1.246 ± 0.285
0.554HisMet: 0.554 ± 0.178
0.762HisAsn: 0.762 ± 0.255
0.762HisPro: 0.762 ± 0.229
0.9HisGln: 0.9 ± 0.206
0.9HisArg: 0.9 ± 0.268
0.692HisSer: 0.692 ± 0.209
0.346HisThr: 0.346 ± 0.118
0.969HisVal: 0.969 ± 0.253
0.138HisTrp: 0.138 ± 0.086
0.415HisTyr: 0.415 ± 0.158
0.0HisXaa: 0.0 ± 0.0
Ile
6.3IleAla: 6.3 ± 0.699
0.692IleCys: 0.692 ± 0.267
3.877IleAsp: 3.877 ± 0.544
4.431IleGlu: 4.431 ± 0.411
1.869IlePhe: 1.869 ± 0.313
3.877IleGly: 3.877 ± 0.46
1.108IleHis: 1.108 ± 0.233
5.123IleIle: 5.123 ± 0.581
4.154IleLys: 4.154 ± 0.557
3.808IleLeu: 3.808 ± 0.514
1.731IleMet: 1.731 ± 0.383
3.184IleAsn: 3.184 ± 0.355
2.769IlePro: 2.769 ± 0.451
2.215IleGln: 2.215 ± 0.418
3.531IleArg: 3.531 ± 0.408
3.6IleSer: 3.6 ± 0.517
3.669IleThr: 3.669 ± 0.544
3.738IleVal: 3.738 ± 0.416
0.692IleTrp: 0.692 ± 0.249
2.354IleTyr: 2.354 ± 0.343
0.0IleXaa: 0.0 ± 0.0
Lys
7.615LysAla: 7.615 ± 0.798
0.762LysCys: 0.762 ± 0.248
3.184LysAsp: 3.184 ± 0.5
3.877LysGlu: 3.877 ± 0.718
2.146LysPhe: 2.146 ± 0.383
3.046LysGly: 3.046 ± 0.559
1.315LysHis: 1.315 ± 0.376
3.738LysIle: 3.738 ± 0.756
3.6LysLys: 3.6 ± 0.605
4.846LysLeu: 4.846 ± 0.614
2.561LysMet: 2.561 ± 0.413
2.561LysAsn: 2.561 ± 0.394
2.631LysPro: 2.631 ± 0.376
2.285LysGln: 2.285 ± 0.308
3.531LysArg: 3.531 ± 0.664
4.015LysSer: 4.015 ± 0.465
3.323LysThr: 3.323 ± 0.494
3.669LysVal: 3.669 ± 0.467
1.038LysTrp: 1.038 ± 0.292
2.908LysTyr: 2.908 ± 0.447
0.0LysXaa: 0.0 ± 0.0
Leu
6.438LeuAla: 6.438 ± 1.011
0.762LeuCys: 0.762 ± 0.278
3.877LeuAsp: 3.877 ± 0.546
4.708LeuGlu: 4.708 ± 0.553
2.492LeuPhe: 2.492 ± 0.532
4.154LeuGly: 4.154 ± 0.614
1.038LeuHis: 1.038 ± 0.287
4.5LeuIle: 4.5 ± 0.463
5.331LeuLys: 5.331 ± 0.656
4.915LeuLeu: 4.915 ± 0.587
1.523LeuMet: 1.523 ± 0.265
4.015LeuAsn: 4.015 ± 0.487
2.561LeuPro: 2.561 ± 0.485
3.046LeuGln: 3.046 ± 0.483
4.708LeuArg: 4.708 ± 0.472
5.815LeuSer: 5.815 ± 0.62
5.261LeuThr: 5.261 ± 0.586
3.6LeuVal: 3.6 ± 0.601
1.177LeuTrp: 1.177 ± 0.271
3.115LeuTyr: 3.115 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
2.631MetAla: 2.631 ± 0.446
0.208MetCys: 0.208 ± 0.126
1.523MetAsp: 1.523 ± 0.37
1.592MetGlu: 1.592 ± 0.287
0.762MetPhe: 0.762 ± 0.231
1.523MetGly: 1.523 ± 0.367
0.692MetHis: 0.692 ± 0.245
2.215MetIle: 2.215 ± 0.363
2.631MetLys: 2.631 ± 0.459
2.492MetLeu: 2.492 ± 0.394
0.623MetMet: 0.623 ± 0.289
1.315MetAsn: 1.315 ± 0.247
1.938MetPro: 1.938 ± 0.379
1.385MetGln: 1.385 ± 0.285
1.938MetArg: 1.938 ± 0.385
1.592MetSer: 1.592 ± 0.3
2.285MetThr: 2.285 ± 0.373
1.8MetVal: 1.8 ± 0.351
0.415MetTrp: 0.415 ± 0.154
0.831MetTyr: 0.831 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
4.431AsnAla: 4.431 ± 0.8
0.415AsnCys: 0.415 ± 0.186
2.492AsnAsp: 2.492 ± 0.447
2.354AsnGlu: 2.354 ± 0.356
0.762AsnPhe: 0.762 ± 0.252
3.738AsnGly: 3.738 ± 0.546
1.108AsnHis: 1.108 ± 0.28
2.838AsnIle: 2.838 ± 0.427
2.977AsnLys: 2.977 ± 0.521
2.908AsnLeu: 2.908 ± 0.414
1.177AsnMet: 1.177 ± 0.293
2.561AsnAsn: 2.561 ± 0.56
2.354AsnPro: 2.354 ± 0.394
2.008AsnGln: 2.008 ± 0.409
2.977AsnArg: 2.977 ± 0.495
2.7AsnSer: 2.7 ± 0.4
2.354AsnThr: 2.354 ± 0.509
3.808AsnVal: 3.808 ± 0.576
0.623AsnTrp: 0.623 ± 0.185
1.661AsnTyr: 1.661 ± 0.304
0.0AsnXaa: 0.0 ± 0.0
Pro
3.254ProAla: 3.254 ± 0.584
0.692ProCys: 0.692 ± 0.219
2.492ProAsp: 2.492 ± 0.455
2.7ProGlu: 2.7 ± 0.525
0.9ProPhe: 0.9 ± 0.255
2.908ProGly: 2.908 ± 0.653
0.9ProHis: 0.9 ± 0.205
1.869ProIle: 1.869 ± 0.376
2.146ProLys: 2.146 ± 0.377
2.354ProLeu: 2.354 ± 0.477
0.969ProMet: 0.969 ± 0.267
0.9ProAsn: 0.9 ± 0.201
1.108ProPro: 1.108 ± 0.252
1.108ProGln: 1.108 ± 0.298
0.831ProArg: 0.831 ± 0.238
2.423ProSer: 2.423 ± 0.503
2.077ProThr: 2.077 ± 0.399
3.808ProVal: 3.808 ± 0.612
0.415ProTrp: 0.415 ± 0.2
1.315ProTyr: 1.315 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
3.392GlnAla: 3.392 ± 1.07
0.485GlnCys: 0.485 ± 0.213
1.454GlnAsp: 1.454 ± 0.359
2.492GlnGlu: 2.492 ± 0.379
0.969GlnPhe: 0.969 ± 0.32
1.661GlnGly: 1.661 ± 0.449
0.831GlnHis: 0.831 ± 0.236
2.146GlnIle: 2.146 ± 0.31
2.354GlnLys: 2.354 ± 0.335
4.361GlnLeu: 4.361 ± 0.664
1.523GlnMet: 1.523 ± 0.327
1.731GlnAsn: 1.731 ± 0.43
1.592GlnPro: 1.592 ± 0.342
2.977GlnGln: 2.977 ± 0.825
2.838GlnArg: 2.838 ± 0.433
3.392GlnSer: 3.392 ± 0.527
2.354GlnThr: 2.354 ± 0.444
2.561GlnVal: 2.561 ± 0.347
0.692GlnTrp: 0.692 ± 0.192
1.454GlnTyr: 1.454 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
4.984ArgAla: 4.984 ± 0.602
0.9ArgCys: 0.9 ± 0.297
2.908ArgAsp: 2.908 ± 0.425
4.084ArgGlu: 4.084 ± 0.529
1.731ArgPhe: 1.731 ± 0.386
3.946ArgGly: 3.946 ± 0.461
1.038ArgHis: 1.038 ± 0.268
3.392ArgIle: 3.392 ± 0.425
3.461ArgLys: 3.461 ± 0.59
4.638ArgLeu: 4.638 ± 0.589
2.008ArgMet: 2.008 ± 0.377
2.354ArgAsn: 2.354 ± 0.47
1.869ArgPro: 1.869 ± 0.411
2.285ArgGln: 2.285 ± 0.409
3.392ArgArg: 3.392 ± 0.546
2.561ArgSer: 2.561 ± 0.401
3.115ArgThr: 3.115 ± 0.391
3.115ArgVal: 3.115 ± 0.477
0.554ArgTrp: 0.554 ± 0.189
2.838ArgTyr: 2.838 ± 0.426
0.0ArgXaa: 0.0 ± 0.0
Ser
6.231SerAla: 6.231 ± 0.774
0.554SerCys: 0.554 ± 0.197
3.946SerAsp: 3.946 ± 0.563
3.6SerGlu: 3.6 ± 0.439
2.077SerPhe: 2.077 ± 0.311
6.231SerGly: 6.231 ± 0.757
1.315SerHis: 1.315 ± 0.312
3.669SerIle: 3.669 ± 0.52
2.908SerLys: 2.908 ± 0.516
5.054SerLeu: 5.054 ± 0.619
2.7SerMet: 2.7 ± 0.512
2.769SerAsn: 2.769 ± 0.414
1.731SerPro: 1.731 ± 0.355
2.908SerGln: 2.908 ± 0.422
2.492SerArg: 2.492 ± 0.49
3.461SerSer: 3.461 ± 0.409
3.6SerThr: 3.6 ± 0.655
4.084SerVal: 4.084 ± 0.725
1.315SerTrp: 1.315 ± 0.364
2.215SerTyr: 2.215 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
4.915ThrAla: 4.915 ± 0.786
0.554ThrCys: 0.554 ± 0.176
3.946ThrAsp: 3.946 ± 0.355
3.738ThrGlu: 3.738 ± 0.543
1.385ThrPhe: 1.385 ± 0.316
5.192ThrGly: 5.192 ± 0.729
0.9ThrHis: 0.9 ± 0.302
3.738ThrIle: 3.738 ± 0.491
2.7ThrLys: 2.7 ± 0.436
4.5ThrLeu: 4.5 ± 0.709
0.969ThrMet: 0.969 ± 0.226
2.492ThrAsn: 2.492 ± 0.467
2.769ThrPro: 2.769 ± 0.512
2.423ThrGln: 2.423 ± 0.334
3.115ThrArg: 3.115 ± 0.397
4.5ThrSer: 4.5 ± 0.77
3.323ThrThr: 3.323 ± 0.603
4.015ThrVal: 4.015 ± 0.794
0.762ThrTrp: 0.762 ± 0.294
1.592ThrTyr: 1.592 ± 0.386
0.0ThrXaa: 0.0 ± 0.0
Val
7.269ValAla: 7.269 ± 0.947
0.831ValCys: 0.831 ± 0.254
3.6ValAsp: 3.6 ± 0.536
4.915ValGlu: 4.915 ± 0.486
2.354ValPhe: 2.354 ± 0.445
4.569ValGly: 4.569 ± 0.437
1.038ValHis: 1.038 ± 0.228
4.777ValIle: 4.777 ± 0.557
4.292ValLys: 4.292 ± 0.646
3.808ValLeu: 3.808 ± 0.518
1.523ValMet: 1.523 ± 0.309
3.877ValAsn: 3.877 ± 0.481
1.938ValPro: 1.938 ± 0.381
2.008ValGln: 2.008 ± 0.346
3.046ValArg: 3.046 ± 0.474
4.638ValSer: 4.638 ± 0.579
5.054ValThr: 5.054 ± 0.682
4.292ValVal: 4.292 ± 0.551
0.831ValTrp: 0.831 ± 0.27
1.523ValTyr: 1.523 ± 0.284
0.0ValXaa: 0.0 ± 0.0
Trp
0.9TrpAla: 0.9 ± 0.28
0.346TrpCys: 0.346 ± 0.157
1.523TrpAsp: 1.523 ± 0.298
0.969TrpGlu: 0.969 ± 0.263
0.554TrpPhe: 0.554 ± 0.171
0.346TrpGly: 0.346 ± 0.136
0.277TrpHis: 0.277 ± 0.144
0.692TrpIle: 0.692 ± 0.238
0.969TrpLys: 0.969 ± 0.241
1.731TrpLeu: 1.731 ± 0.391
0.208TrpMet: 0.208 ± 0.121
0.692TrpAsn: 0.692 ± 0.214
0.692TrpPro: 0.692 ± 0.156
0.969TrpGln: 0.969 ± 0.277
1.108TrpArg: 1.108 ± 0.275
0.692TrpSer: 0.692 ± 0.194
1.177TrpThr: 1.177 ± 0.323
1.108TrpVal: 1.108 ± 0.311
0.208TrpTrp: 0.208 ± 0.119
0.346TrpTyr: 0.346 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.877TyrAla: 3.877 ± 0.469
0.415TyrCys: 0.415 ± 0.131
3.046TyrAsp: 3.046 ± 0.501
1.731TyrGlu: 1.731 ± 0.365
1.523TyrPhe: 1.523 ± 0.346
2.908TyrGly: 2.908 ± 0.431
0.485TyrHis: 0.485 ± 0.201
1.523TyrIle: 1.523 ± 0.381
1.731TyrLys: 1.731 ± 0.341
2.7TyrLeu: 2.7 ± 0.446
0.692TyrMet: 0.692 ± 0.195
1.661TyrAsn: 1.661 ± 0.415
1.385TyrPro: 1.385 ± 0.345
1.661TyrGln: 1.661 ± 0.3
1.523TyrArg: 1.523 ± 0.306
2.492TyrSer: 2.492 ± 0.443
2.561TyrThr: 2.561 ± 0.415
2.769TyrVal: 2.769 ± 0.417
0.485TyrTrp: 0.485 ± 0.17
1.385TyrTyr: 1.385 ± 0.31
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (14446 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski