Amino acid dipepetide frequency for Cronobacter virus Esp2949-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.171AlaAla: 11.171 ± 2.41
1.117AlaCys: 1.117 ± 0.296
5.586AlaAsp: 5.586 ± 0.719
6.463AlaGlu: 6.463 ± 0.589
2.474AlaPhe: 2.474 ± 0.415
6.703AlaGly: 6.703 ± 0.798
1.117AlaHis: 1.117 ± 0.388
4.788AlaIle: 4.788 ± 0.565
6.623AlaLys: 6.623 ± 1.009
7.581AlaLeu: 7.581 ± 0.625
2.873AlaMet: 2.873 ± 0.404
3.591AlaAsn: 3.591 ± 0.604
2.553AlaPro: 2.553 ± 0.397
4.788AlaGln: 4.788 ± 0.667
5.346AlaArg: 5.346 ± 0.706
7.66AlaSer: 7.66 ± 1.351
3.671AlaThr: 3.671 ± 0.684
6.384AlaVal: 6.384 ± 0.715
1.117AlaTrp: 1.117 ± 0.381
2.314AlaTyr: 2.314 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.798CysAla: 0.798 ± 0.259
0.239CysCys: 0.239 ± 0.126
0.878CysAsp: 0.878 ± 0.263
0.718CysGlu: 0.718 ± 0.255
0.559CysPhe: 0.559 ± 0.223
1.117CysGly: 1.117 ± 0.356
0.319CysHis: 0.319 ± 0.153
0.798CysIle: 0.798 ± 0.293
0.559CysLys: 0.559 ± 0.255
0.718CysLeu: 0.718 ± 0.219
0.08CysMet: 0.08 ± 0.092
0.559CysAsn: 0.559 ± 0.195
0.479CysPro: 0.479 ± 0.204
0.319CysGln: 0.319 ± 0.158
0.798CysArg: 0.798 ± 0.296
0.559CysSer: 0.559 ± 0.18
0.878CysThr: 0.878 ± 0.252
0.798CysVal: 0.798 ± 0.228
0.319CysTrp: 0.319 ± 0.159
0.479CysTyr: 0.479 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
5.506AspAla: 5.506 ± 0.708
0.08AspCys: 0.08 ± 0.091
4.389AspAsp: 4.389 ± 0.806
3.591AspGlu: 3.591 ± 0.485
2.314AspPhe: 2.314 ± 0.432
7.182AspGly: 7.182 ± 0.897
1.037AspHis: 1.037 ± 0.348
2.713AspIle: 2.713 ± 0.385
3.591AspLys: 3.591 ± 0.417
4.309AspLeu: 4.309 ± 0.635
1.516AspMet: 1.516 ± 0.333
2.713AspAsn: 2.713 ± 0.433
2.234AspPro: 2.234 ± 0.335
1.835AspGln: 1.835 ± 0.431
3.112AspArg: 3.112 ± 0.644
3.99AspSer: 3.99 ± 0.565
2.873AspThr: 2.873 ± 0.582
4.07AspVal: 4.07 ± 0.477
0.718AspTrp: 0.718 ± 0.197
2.394AspTyr: 2.394 ± 0.526
0.0AspXaa: 0.0 ± 0.0
Glu
5.346GluAla: 5.346 ± 0.741
0.718GluCys: 0.718 ± 0.224
2.713GluAsp: 2.713 ± 0.556
3.83GluGlu: 3.83 ± 0.564
2.793GluPhe: 2.793 ± 0.465
3.112GluGly: 3.112 ± 0.461
1.277GluHis: 1.277 ± 0.296
5.187GluIle: 5.187 ± 0.532
3.511GluLys: 3.511 ± 0.41
5.027GluLeu: 5.027 ± 0.641
3.032GluMet: 3.032 ± 0.472
2.553GluAsn: 2.553 ± 0.451
1.835GluPro: 1.835 ± 0.439
3.591GluGln: 3.591 ± 0.655
3.032GluArg: 3.032 ± 0.489
3.91GluSer: 3.91 ± 0.595
3.591GluThr: 3.591 ± 0.856
4.389GluVal: 4.389 ± 0.555
0.399GluTrp: 0.399 ± 0.168
2.394GluTyr: 2.394 ± 0.409
0.0GluXaa: 0.0 ± 0.0
Phe
2.793PheAla: 2.793 ± 0.427
0.638PheCys: 0.638 ± 0.213
2.873PheAsp: 2.873 ± 0.595
2.075PheGlu: 2.075 ± 0.374
1.436PhePhe: 1.436 ± 0.322
3.272PheGly: 3.272 ± 0.579
0.638PheHis: 0.638 ± 0.241
1.357PheIle: 1.357 ± 0.285
2.234PheLys: 2.234 ± 0.412
2.154PheLeu: 2.154 ± 0.359
0.638PheMet: 0.638 ± 0.211
2.873PheAsn: 2.873 ± 0.441
1.357PhePro: 1.357 ± 0.383
1.117PheGln: 1.117 ± 0.343
1.915PheArg: 1.915 ± 0.353
2.314PheSer: 2.314 ± 0.408
2.633PheThr: 2.633 ± 0.399
2.713PheVal: 2.713 ± 0.406
0.718PheTrp: 0.718 ± 0.214
0.798PheTyr: 0.798 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
5.985GlyAla: 5.985 ± 0.69
0.718GlyCys: 0.718 ± 0.195
4.229GlyAsp: 4.229 ± 0.553
4.149GlyGlu: 4.149 ± 0.586
2.075GlyPhe: 2.075 ± 0.426
5.985GlyGly: 5.985 ± 1.119
1.117GlyHis: 1.117 ± 0.5
4.628GlyIle: 4.628 ± 0.543
6.623GlyLys: 6.623 ± 0.623
4.469GlyLeu: 4.469 ± 0.517
2.075GlyMet: 2.075 ± 0.429
3.99GlyAsn: 3.99 ± 0.57
1.277GlyPro: 1.277 ± 0.348
1.835GlyGln: 1.835 ± 0.361
4.229GlyArg: 4.229 ± 0.533
5.586GlySer: 5.586 ± 0.798
2.952GlyThr: 2.952 ± 0.644
5.825GlyVal: 5.825 ± 0.668
1.756GlyTrp: 1.756 ± 0.41
3.192GlyTyr: 3.192 ± 0.518
0.0GlyXaa: 0.0 ± 0.0
His
1.436HisAla: 1.436 ± 0.363
0.239HisCys: 0.239 ± 0.131
1.277HisAsp: 1.277 ± 0.362
0.798HisGlu: 0.798 ± 0.244
0.798HisPhe: 0.798 ± 0.249
1.436HisGly: 1.436 ± 0.428
0.479HisHis: 0.479 ± 0.252
1.037HisIle: 1.037 ± 0.271
0.878HisLys: 0.878 ± 0.241
1.676HisLeu: 1.676 ± 0.485
0.08HisMet: 0.08 ± 0.071
0.479HisAsn: 0.479 ± 0.217
1.037HisPro: 1.037 ± 0.326
0.399HisGln: 0.399 ± 0.161
1.117HisArg: 1.117 ± 0.324
0.798HisSer: 0.798 ± 0.231
0.638HisThr: 0.638 ± 0.239
1.516HisVal: 1.516 ± 0.441
0.319HisTrp: 0.319 ± 0.151
0.559HisTyr: 0.559 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
5.665IleAla: 5.665 ± 0.494
1.197IleCys: 1.197 ± 0.351
3.511IleAsp: 3.511 ± 0.504
3.431IleGlu: 3.431 ± 0.503
2.154IlePhe: 2.154 ± 0.39
3.671IleGly: 3.671 ± 0.467
1.197IleHis: 1.197 ± 0.289
3.192IleIle: 3.192 ± 0.433
3.83IleLys: 3.83 ± 0.448
3.032IleLeu: 3.032 ± 0.548
1.436IleMet: 1.436 ± 0.415
3.511IleAsn: 3.511 ± 0.675
2.553IlePro: 2.553 ± 0.374
2.474IleGln: 2.474 ± 0.457
3.032IleArg: 3.032 ± 0.473
4.708IleSer: 4.708 ± 0.526
4.07IleThr: 4.07 ± 0.614
4.07IleVal: 4.07 ± 0.571
0.958IleTrp: 0.958 ± 0.283
1.915IleTyr: 1.915 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
5.107LysAla: 5.107 ± 0.717
0.958LysCys: 0.958 ± 0.336
3.83LysAsp: 3.83 ± 0.528
4.149LysGlu: 4.149 ± 0.544
2.234LysPhe: 2.234 ± 0.352
3.83LysGly: 3.83 ± 0.401
1.197LysHis: 1.197 ± 0.326
3.351LysIle: 3.351 ± 0.565
4.469LysLys: 4.469 ± 0.789
5.187LysLeu: 5.187 ± 0.788
1.915LysMet: 1.915 ± 0.428
3.75LysAsn: 3.75 ± 0.615
2.793LysPro: 2.793 ± 0.477
2.154LysGln: 2.154 ± 0.513
2.952LysArg: 2.952 ± 0.597
3.83LysSer: 3.83 ± 0.579
4.389LysThr: 4.389 ± 0.799
3.83LysVal: 3.83 ± 0.673
1.197LysTrp: 1.197 ± 0.259
1.596LysTyr: 1.596 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
7.261LeuAla: 7.261 ± 0.824
0.958LeuCys: 0.958 ± 0.283
3.591LeuAsp: 3.591 ± 0.672
3.671LeuGlu: 3.671 ± 0.503
2.553LeuPhe: 2.553 ± 0.408
4.947LeuGly: 4.947 ± 0.68
0.878LeuHis: 0.878 ± 0.323
4.149LeuIle: 4.149 ± 0.484
4.548LeuLys: 4.548 ± 0.562
4.469LeuLeu: 4.469 ± 0.603
2.154LeuMet: 2.154 ± 0.444
2.793LeuAsn: 2.793 ± 0.453
3.272LeuPro: 3.272 ± 0.547
3.272LeuGln: 3.272 ± 0.605
4.149LeuArg: 4.149 ± 0.574
5.506LeuSer: 5.506 ± 0.548
4.389LeuThr: 4.389 ± 0.525
5.107LeuVal: 5.107 ± 0.594
0.638LeuTrp: 0.638 ± 0.189
2.234LeuTyr: 2.234 ± 0.356
0.0LeuXaa: 0.0 ± 0.0
Met
2.873MetAla: 2.873 ± 0.524
0.239MetCys: 0.239 ± 0.158
1.277MetAsp: 1.277 ± 0.318
1.436MetGlu: 1.436 ± 0.289
0.958MetPhe: 0.958 ± 0.267
0.958MetGly: 0.958 ± 0.281
0.559MetHis: 0.559 ± 0.231
2.075MetIle: 2.075 ± 0.399
1.915MetLys: 1.915 ± 0.457
3.032MetLeu: 3.032 ± 0.662
1.117MetMet: 1.117 ± 0.302
1.197MetAsn: 1.197 ± 0.308
0.798MetPro: 0.798 ± 0.22
1.357MetGln: 1.357 ± 0.417
2.075MetArg: 2.075 ± 0.351
1.756MetSer: 1.756 ± 0.44
1.037MetThr: 1.037 ± 0.267
1.596MetVal: 1.596 ± 0.45
0.16MetTrp: 0.16 ± 0.12
0.798MetTyr: 0.798 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
5.187AsnAla: 5.187 ± 1.031
0.319AsnCys: 0.319 ± 0.174
2.873AsnAsp: 2.873 ± 0.45
2.314AsnGlu: 2.314 ± 0.352
1.676AsnPhe: 1.676 ± 0.224
4.788AsnGly: 4.788 ± 0.647
0.878AsnHis: 0.878 ± 0.334
2.873AsnIle: 2.873 ± 0.511
2.154AsnLys: 2.154 ± 0.419
3.272AsnLeu: 3.272 ± 0.423
1.117AsnMet: 1.117 ± 0.316
2.234AsnAsn: 2.234 ± 0.43
2.075AsnPro: 2.075 ± 0.467
2.075AsnGln: 2.075 ± 0.394
2.713AsnArg: 2.713 ± 0.579
2.793AsnSer: 2.793 ± 0.529
2.394AsnThr: 2.394 ± 0.336
4.07AsnVal: 4.07 ± 0.581
0.798AsnTrp: 0.798 ± 0.217
1.756AsnTyr: 1.756 ± 0.374
0.0AsnXaa: 0.0 ± 0.0
Pro
3.272ProAla: 3.272 ± 0.493
0.559ProCys: 0.559 ± 0.241
3.671ProAsp: 3.671 ± 0.513
2.873ProGlu: 2.873 ± 0.724
1.596ProPhe: 1.596 ± 0.305
2.394ProGly: 2.394 ± 0.436
0.638ProHis: 0.638 ± 0.189
1.596ProIle: 1.596 ± 0.313
1.277ProLys: 1.277 ± 0.276
2.075ProLeu: 2.075 ± 0.431
1.037ProMet: 1.037 ± 0.353
1.915ProAsn: 1.915 ± 0.425
1.277ProPro: 1.277 ± 0.385
1.436ProGln: 1.436 ± 0.321
1.915ProArg: 1.915 ± 0.364
2.394ProSer: 2.394 ± 0.427
1.277ProThr: 1.277 ± 0.332
3.671ProVal: 3.671 ± 0.586
0.319ProTrp: 0.319 ± 0.152
1.516ProTyr: 1.516 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
3.99GlnAla: 3.99 ± 0.53
0.718GlnCys: 0.718 ± 0.301
1.995GlnAsp: 1.995 ± 0.458
2.394GlnGlu: 2.394 ± 0.476
1.197GlnPhe: 1.197 ± 0.327
2.474GlnGly: 2.474 ± 0.39
0.638GlnHis: 0.638 ± 0.252
3.351GlnIle: 3.351 ± 0.457
2.633GlnLys: 2.633 ± 0.558
2.553GlnLeu: 2.553 ± 0.483
1.037GlnMet: 1.037 ± 0.267
1.676GlnAsn: 1.676 ± 0.342
1.117GlnPro: 1.117 ± 0.407
2.952GlnGln: 2.952 ± 0.685
2.154GlnArg: 2.154 ± 0.408
3.112GlnSer: 3.112 ± 0.47
1.835GlnThr: 1.835 ± 0.371
3.671GlnVal: 3.671 ± 0.534
0.559GlnTrp: 0.559 ± 0.226
1.756GlnTyr: 1.756 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
4.868ArgAla: 4.868 ± 0.507
0.718ArgCys: 0.718 ± 0.357
2.713ArgAsp: 2.713 ± 0.463
3.591ArgGlu: 3.591 ± 0.538
2.633ArgPhe: 2.633 ± 0.444
3.192ArgGly: 3.192 ± 0.479
1.277ArgHis: 1.277 ± 0.32
3.83ArgIle: 3.83 ± 0.501
4.07ArgLys: 4.07 ± 0.653
4.788ArgLeu: 4.788 ± 0.722
1.676ArgMet: 1.676 ± 0.362
2.075ArgAsn: 2.075 ± 0.391
2.713ArgPro: 2.713 ± 0.524
2.234ArgGln: 2.234 ± 0.575
4.229ArgArg: 4.229 ± 0.686
3.032ArgSer: 3.032 ± 0.448
2.394ArgThr: 2.394 ± 0.397
4.07ArgVal: 4.07 ± 0.819
0.399ArgTrp: 0.399 ± 0.189
1.676ArgTyr: 1.676 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
8.618SerAla: 8.618 ± 1.55
0.638SerCys: 0.638 ± 0.218
3.75SerAsp: 3.75 ± 0.477
5.825SerGlu: 5.825 ± 0.7
2.553SerPhe: 2.553 ± 0.424
5.506SerGly: 5.506 ± 0.653
1.197SerHis: 1.197 ± 0.284
4.149SerIle: 4.149 ± 0.532
3.75SerLys: 3.75 ± 0.545
5.027SerLeu: 5.027 ± 0.622
1.596SerMet: 1.596 ± 0.388
3.272SerAsn: 3.272 ± 0.692
2.075SerPro: 2.075 ± 0.411
2.873SerGln: 2.873 ± 0.503
3.83SerArg: 3.83 ± 0.503
3.75SerSer: 3.75 ± 0.655
2.793SerThr: 2.793 ± 0.436
5.187SerVal: 5.187 ± 0.658
0.798SerTrp: 0.798 ± 0.257
2.394SerTyr: 2.394 ± 0.515
0.0SerXaa: 0.0 ± 0.0
Thr
5.027ThrAla: 5.027 ± 0.61
0.399ThrCys: 0.399 ± 0.197
3.112ThrAsp: 3.112 ± 0.421
2.873ThrGlu: 2.873 ± 0.527
2.234ThrPhe: 2.234 ± 0.425
5.027ThrGly: 5.027 ± 0.577
0.16ThrHis: 0.16 ± 0.119
3.511ThrIle: 3.511 ± 0.459
2.234ThrLys: 2.234 ± 0.503
3.272ThrLeu: 3.272 ± 0.443
0.878ThrMet: 0.878 ± 0.271
3.671ThrAsn: 3.671 ± 0.615
2.713ThrPro: 2.713 ± 0.383
2.394ThrGln: 2.394 ± 0.521
1.516ThrArg: 1.516 ± 0.252
4.947ThrSer: 4.947 ± 1.165
2.873ThrThr: 2.873 ± 0.516
3.91ThrVal: 3.91 ± 0.486
0.878ThrTrp: 0.878 ± 0.272
2.075ThrTyr: 2.075 ± 0.457
0.0ThrXaa: 0.0 ± 0.0
Val
5.665ValAla: 5.665 ± 0.865
0.798ValCys: 0.798 ± 0.33
4.788ValAsp: 4.788 ± 0.539
5.187ValGlu: 5.187 ± 0.567
2.713ValPhe: 2.713 ± 0.528
4.07ValGly: 4.07 ± 0.514
1.596ValHis: 1.596 ± 0.343
4.788ValIle: 4.788 ± 0.788
5.027ValLys: 5.027 ± 0.64
4.469ValLeu: 4.469 ± 0.479
1.516ValMet: 1.516 ± 0.392
3.351ValAsn: 3.351 ± 0.478
2.952ValPro: 2.952 ± 0.456
2.633ValGln: 2.633 ± 0.521
4.708ValArg: 4.708 ± 0.695
5.187ValSer: 5.187 ± 0.697
5.745ValThr: 5.745 ± 0.765
5.346ValVal: 5.346 ± 0.608
0.798ValTrp: 0.798 ± 0.185
1.596ValTyr: 1.596 ± 0.337
0.0ValXaa: 0.0 ± 0.0
Trp
0.718TrpAla: 0.718 ± 0.258
0.239TrpCys: 0.239 ± 0.137
0.798TrpAsp: 0.798 ± 0.206
0.878TrpGlu: 0.878 ± 0.374
0.479TrpPhe: 0.479 ± 0.177
0.878TrpGly: 0.878 ± 0.199
0.319TrpHis: 0.319 ± 0.143
0.798TrpIle: 0.798 ± 0.255
0.958TrpLys: 0.958 ± 0.236
1.596TrpLeu: 1.596 ± 0.374
0.638TrpMet: 0.638 ± 0.171
0.718TrpAsn: 0.718 ± 0.199
0.319TrpPro: 0.319 ± 0.18
0.16TrpGln: 0.16 ± 0.106
0.798TrpArg: 0.798 ± 0.245
1.197TrpSer: 1.197 ± 0.43
0.798TrpThr: 0.798 ± 0.253
0.479TrpVal: 0.479 ± 0.19
0.16TrpTrp: 0.16 ± 0.119
0.479TrpTyr: 0.479 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.713TyrAla: 2.713 ± 0.408
0.479TyrCys: 0.479 ± 0.177
2.314TyrAsp: 2.314 ± 0.463
1.995TyrGlu: 1.995 ± 0.42
1.117TyrPhe: 1.117 ± 0.318
2.154TyrGly: 2.154 ± 0.389
0.479TyrHis: 0.479 ± 0.22
1.436TyrIle: 1.436 ± 0.35
1.835TyrLys: 1.835 ± 0.38
1.835TyrLeu: 1.835 ± 0.368
0.718TyrMet: 0.718 ± 0.244
1.436TyrAsn: 1.436 ± 0.329
1.277TyrPro: 1.277 ± 0.428
1.915TyrGln: 1.915 ± 0.357
2.553TyrArg: 2.553 ± 0.434
2.553TyrSer: 2.553 ± 0.473
2.474TyrThr: 2.474 ± 0.382
2.234TyrVal: 2.234 ± 0.385
0.399TyrTrp: 0.399 ± 0.159
0.798TyrTyr: 0.798 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (12533 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski