Amino acid dipepetide frequency for Leptospira phage LE4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.322AlaAla: 0.322 ± 0.157
0.129AlaCys: 0.129 ± 0.09
2.576AlaAsp: 2.576 ± 0.404
3.349AlaGlu: 3.349 ± 0.522
3.285AlaPhe: 3.285 ± 0.442
2.963AlaGly: 2.963 ± 0.587
0.902AlaHis: 0.902 ± 0.201
4.766AlaIle: 4.766 ± 0.601
5.153AlaLys: 5.153 ± 0.818
4.251AlaLeu: 4.251 ± 0.504
1.739AlaMet: 1.739 ± 0.328
2.77AlaAsn: 2.77 ± 0.343
2.061AlaPro: 2.061 ± 0.367
1.481AlaGln: 1.481 ± 0.381
3.156AlaArg: 3.156 ± 0.378
4.122AlaSer: 4.122 ± 0.63
3.285AlaThr: 3.285 ± 0.55
3.285AlaVal: 3.285 ± 0.515
0.708AlaTrp: 0.708 ± 0.181
2.254AlaTyr: 2.254 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.322CysAla: 0.322 ± 0.173
0.193CysCys: 0.193 ± 0.102
0.386CysAsp: 0.386 ± 0.141
0.773CysGlu: 0.773 ± 0.237
0.451CysPhe: 0.451 ± 0.154
0.902CysGly: 0.902 ± 0.295
0.322CysHis: 0.322 ± 0.154
0.58CysIle: 0.58 ± 0.195
0.837CysLys: 0.837 ± 0.248
0.708CysLeu: 0.708 ± 0.27
0.258CysMet: 0.258 ± 0.131
0.258CysAsn: 0.258 ± 0.155
0.515CysPro: 0.515 ± 0.153
0.386CysGln: 0.386 ± 0.15
0.386CysArg: 0.386 ± 0.202
0.515CysSer: 0.515 ± 0.19
0.386CysThr: 0.386 ± 0.139
0.451CysVal: 0.451 ± 0.161
0.064CysTrp: 0.064 ± 0.071
0.451CysTyr: 0.451 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
2.898AspAla: 2.898 ± 0.432
0.708AspCys: 0.708 ± 0.233
2.125AspAsp: 2.125 ± 0.44
4.895AspGlu: 4.895 ± 0.482
4.058AspPhe: 4.058 ± 0.497
3.349AspGly: 3.349 ± 0.605
0.837AspHis: 0.837 ± 0.253
3.8AspIle: 3.8 ± 0.604
5.088AspLys: 5.088 ± 0.442
5.217AspLeu: 5.217 ± 0.599
0.902AspMet: 0.902 ± 0.223
2.19AspAsn: 2.19 ± 0.315
2.448AspPro: 2.448 ± 0.365
2.061AspGln: 2.061 ± 0.376
1.546AspArg: 1.546 ± 0.383
3.864AspSer: 3.864 ± 0.437
2.19AspThr: 2.19 ± 0.372
2.383AspVal: 2.383 ± 0.456
0.902AspTrp: 0.902 ± 0.189
1.803AspTyr: 1.803 ± 0.328
0.0AspXaa: 0.0 ± 0.0
Glu
4.766GluAla: 4.766 ± 0.722
0.902GluCys: 0.902 ± 0.251
4.315GluAsp: 4.315 ± 0.605
6.376GluGlu: 6.376 ± 1.009
3.607GluPhe: 3.607 ± 0.492
3.478GluGly: 3.478 ± 0.44
0.773GluHis: 0.773 ± 0.242
7.471GluIle: 7.471 ± 0.795
9.404GluLys: 9.404 ± 1.214
6.376GluLeu: 6.376 ± 0.727
1.932GluMet: 1.932 ± 0.389
5.797GluAsn: 5.797 ± 0.647
2.448GluPro: 2.448 ± 0.547
2.898GluGln: 2.898 ± 0.548
3.092GluArg: 3.092 ± 0.388
5.346GluSer: 5.346 ± 0.646
4.187GluThr: 4.187 ± 0.495
4.637GluVal: 4.637 ± 0.748
1.031GluTrp: 1.031 ± 0.303
2.576GluTyr: 2.576 ± 0.395
0.0GluXaa: 0.0 ± 0.0
Phe
2.125PheAla: 2.125 ± 0.368
0.837PheCys: 0.837 ± 0.251
2.641PheAsp: 2.641 ± 0.388
3.929PheGlu: 3.929 ± 0.476
1.481PhePhe: 1.481 ± 0.298
3.414PheGly: 3.414 ± 0.496
1.224PheHis: 1.224 ± 0.313
3.22PheIle: 3.22 ± 0.466
3.993PheLys: 3.993 ± 0.515
3.285PheLeu: 3.285 ± 0.51
0.773PheMet: 0.773 ± 0.286
2.319PheAsn: 2.319 ± 0.407
1.61PhePro: 1.61 ± 0.313
2.254PheGln: 2.254 ± 0.324
2.254PheArg: 2.254 ± 0.42
3.542PheSer: 3.542 ± 0.492
3.092PheThr: 3.092 ± 0.385
2.898PheVal: 2.898 ± 0.516
0.515PheTrp: 0.515 ± 0.173
2.319PheTyr: 2.319 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
3.671GlyAla: 3.671 ± 0.746
0.58GlyCys: 0.58 ± 0.206
2.319GlyAsp: 2.319 ± 0.417
4.058GlyGlu: 4.058 ± 0.57
3.22GlyPhe: 3.22 ± 0.606
4.315GlyGly: 4.315 ± 0.742
1.353GlyHis: 1.353 ± 0.302
4.315GlyIle: 4.315 ± 0.54
4.702GlyLys: 4.702 ± 0.592
4.766GlyLeu: 4.766 ± 0.708
1.675GlyMet: 1.675 ± 0.319
3.414GlyAsn: 3.414 ± 0.476
0.708GlyPro: 0.708 ± 0.307
2.319GlyGln: 2.319 ± 0.397
2.254GlyArg: 2.254 ± 0.402
4.831GlySer: 4.831 ± 0.919
4.831GlyThr: 4.831 ± 0.67
5.024GlyVal: 5.024 ± 0.647
0.708GlyTrp: 0.708 ± 0.211
2.512GlyTyr: 2.512 ± 0.448
0.0GlyXaa: 0.0 ± 0.0
His
0.837HisAla: 0.837 ± 0.202
0.258HisCys: 0.258 ± 0.134
0.837HisAsp: 0.837 ± 0.313
1.159HisGlu: 1.159 ± 0.269
0.773HisPhe: 0.773 ± 0.214
0.773HisGly: 0.773 ± 0.227
0.322HisHis: 0.322 ± 0.144
0.966HisIle: 0.966 ± 0.279
1.031HisLys: 1.031 ± 0.226
2.061HisLeu: 2.061 ± 0.529
0.193HisMet: 0.193 ± 0.103
0.451HisAsn: 0.451 ± 0.158
0.902HisPro: 0.902 ± 0.266
0.258HisGln: 0.258 ± 0.152
0.451HisArg: 0.451 ± 0.158
1.417HisSer: 1.417 ± 0.319
0.644HisThr: 0.644 ± 0.198
1.159HisVal: 1.159 ± 0.338
0.386HisTrp: 0.386 ± 0.2
1.031HisTyr: 1.031 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
3.864IleAla: 3.864 ± 0.408
0.322IleCys: 0.322 ± 0.146
4.122IleAsp: 4.122 ± 0.587
6.57IleGlu: 6.57 ± 0.627
3.092IlePhe: 3.092 ± 0.501
4.766IleGly: 4.766 ± 0.634
1.095IleHis: 1.095 ± 0.247
4.187IleIle: 4.187 ± 0.534
7.407IleLys: 7.407 ± 0.814
6.441IleLeu: 6.441 ± 0.78
0.966IleMet: 0.966 ± 0.285
3.285IleAsn: 3.285 ± 0.503
3.22IlePro: 3.22 ± 0.488
5.41IleGln: 5.41 ± 0.585
3.156IleArg: 3.156 ± 0.528
5.281IleSer: 5.281 ± 0.589
3.542IleThr: 3.542 ± 0.48
4.509IleVal: 4.509 ± 0.49
0.966IleTrp: 0.966 ± 0.251
3.285IleTyr: 3.285 ± 0.584
0.0IleXaa: 0.0 ± 0.0
Lys
5.346LysAla: 5.346 ± 0.814
0.966LysCys: 0.966 ± 0.331
4.895LysAsp: 4.895 ± 0.679
9.726LysGlu: 9.726 ± 1.123
4.187LysPhe: 4.187 ± 0.492
5.153LysGly: 5.153 ± 0.742
1.675LysHis: 1.675 ± 0.334
8.566LysIle: 8.566 ± 0.801
9.854LysLys: 9.854 ± 1.254
5.604LysLeu: 5.604 ± 0.567
3.156LysMet: 3.156 ± 0.47
5.732LysAsn: 5.732 ± 0.634
3.478LysPro: 3.478 ± 0.444
3.929LysGln: 3.929 ± 0.703
3.478LysArg: 3.478 ± 0.413
5.668LysSer: 5.668 ± 0.728
6.312LysThr: 6.312 ± 0.649
5.475LysVal: 5.475 ± 0.45
0.837LysTrp: 0.837 ± 0.266
3.156LysTyr: 3.156 ± 0.532
0.0LysXaa: 0.0 ± 0.0
Leu
2.963LeuAla: 2.963 ± 0.441
0.773LeuCys: 0.773 ± 0.262
5.281LeuAsp: 5.281 ± 0.686
7.085LeuGlu: 7.085 ± 0.661
3.8LeuPhe: 3.8 ± 0.548
4.895LeuGly: 4.895 ± 0.635
0.773LeuHis: 0.773 ± 0.216
5.153LeuIle: 5.153 ± 0.665
7.343LeuLys: 7.343 ± 0.72
5.861LeuLeu: 5.861 ± 0.656
1.224LeuMet: 1.224 ± 0.308
4.959LeuAsn: 4.959 ± 0.571
2.898LeuPro: 2.898 ± 0.434
3.478LeuGln: 3.478 ± 0.52
3.156LeuArg: 3.156 ± 0.424
5.926LeuSer: 5.926 ± 0.667
4.058LeuThr: 4.058 ± 0.523
4.058LeuVal: 4.058 ± 0.39
0.966LeuTrp: 0.966 ± 0.227
2.319LeuTyr: 2.319 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
1.417MetAla: 1.417 ± 0.331
0.129MetCys: 0.129 ± 0.086
1.353MetAsp: 1.353 ± 0.338
1.868MetGlu: 1.868 ± 0.365
1.095MetPhe: 1.095 ± 0.242
1.095MetGly: 1.095 ± 0.258
0.451MetHis: 0.451 ± 0.185
2.061MetIle: 2.061 ± 0.378
3.092MetLys: 3.092 ± 0.424
1.288MetLeu: 1.288 ± 0.388
0.966MetMet: 0.966 ± 0.338
1.868MetAsn: 1.868 ± 0.354
0.902MetPro: 0.902 ± 0.281
0.708MetGln: 0.708 ± 0.23
1.095MetArg: 1.095 ± 0.252
1.353MetSer: 1.353 ± 0.3
0.966MetThr: 0.966 ± 0.312
0.837MetVal: 0.837 ± 0.201
0.0MetTrp: 0.0 ± 0.0
0.58MetTyr: 0.58 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.414AsnAla: 3.414 ± 0.442
0.193AsnCys: 0.193 ± 0.118
2.898AsnAsp: 2.898 ± 0.436
3.929AsnGlu: 3.929 ± 0.464
2.705AsnPhe: 2.705 ± 0.428
3.607AsnGly: 3.607 ± 0.553
1.224AsnHis: 1.224 ± 0.28
3.285AsnIle: 3.285 ± 0.467
4.444AsnLys: 4.444 ± 0.569
5.153AsnLeu: 5.153 ± 0.516
1.288AsnMet: 1.288 ± 0.312
2.512AsnAsn: 2.512 ± 0.397
2.834AsnPro: 2.834 ± 0.455
2.705AsnGln: 2.705 ± 0.359
2.898AsnArg: 2.898 ± 0.433
3.607AsnSer: 3.607 ± 0.515
1.868AsnThr: 1.868 ± 0.443
2.705AsnVal: 2.705 ± 0.424
0.193AsnTrp: 0.193 ± 0.102
1.739AsnTyr: 1.739 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
2.641ProAla: 2.641 ± 0.422
0.258ProCys: 0.258 ± 0.118
1.932ProAsp: 1.932 ± 0.315
4.251ProGlu: 4.251 ± 0.597
1.546ProPhe: 1.546 ± 0.281
2.254ProGly: 2.254 ± 0.315
0.193ProHis: 0.193 ± 0.154
3.027ProIle: 3.027 ± 0.384
3.414ProLys: 3.414 ± 0.621
2.576ProLeu: 2.576 ± 0.341
0.58ProMet: 0.58 ± 0.169
1.739ProAsn: 1.739 ± 0.348
1.224ProPro: 1.224 ± 0.324
1.417ProGln: 1.417 ± 0.297
1.224ProArg: 1.224 ± 0.262
2.512ProSer: 2.512 ± 0.364
2.512ProThr: 2.512 ± 0.469
1.868ProVal: 1.868 ± 0.297
0.322ProTrp: 0.322 ± 0.146
1.546ProTyr: 1.546 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
2.319GlnAla: 2.319 ± 0.372
0.58GlnCys: 0.58 ± 0.18
1.739GlnAsp: 1.739 ± 0.314
2.834GlnGlu: 2.834 ± 0.423
2.061GlnPhe: 2.061 ± 0.29
2.834GlnGly: 2.834 ± 0.43
0.129GlnHis: 0.129 ± 0.084
3.864GlnIle: 3.864 ± 0.457
5.088GlnLys: 5.088 ± 0.669
3.156GlnLeu: 3.156 ± 0.441
0.708GlnMet: 0.708 ± 0.212
2.834GlnAsn: 2.834 ± 0.398
1.481GlnPro: 1.481 ± 0.295
1.159GlnGln: 1.159 ± 0.281
1.997GlnArg: 1.997 ± 0.451
2.319GlnSer: 2.319 ± 0.357
2.641GlnThr: 2.641 ± 0.386
1.932GlnVal: 1.932 ± 0.337
0.258GlnTrp: 0.258 ± 0.133
1.224GlnTyr: 1.224 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
2.448ArgAla: 2.448 ± 0.503
0.451ArgCys: 0.451 ± 0.172
1.997ArgAsp: 1.997 ± 0.432
3.736ArgGlu: 3.736 ± 0.539
2.061ArgPhe: 2.061 ± 0.407
1.932ArgGly: 1.932 ± 0.379
0.837ArgHis: 0.837 ± 0.254
3.22ArgIle: 3.22 ± 0.53
4.573ArgLys: 4.573 ± 0.487
2.963ArgLeu: 2.963 ± 0.402
1.031ArgMet: 1.031 ± 0.269
1.868ArgAsn: 1.868 ± 0.423
1.61ArgPro: 1.61 ± 0.273
1.481ArgGln: 1.481 ± 0.351
1.997ArgArg: 1.997 ± 0.298
2.77ArgSer: 2.77 ± 0.405
1.932ArgThr: 1.932 ± 0.365
2.19ArgVal: 2.19 ± 0.427
0.451ArgTrp: 0.451 ± 0.168
1.481ArgTyr: 1.481 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
4.058SerAla: 4.058 ± 0.553
0.193SerCys: 0.193 ± 0.103
3.671SerAsp: 3.671 ± 0.466
5.088SerGlu: 5.088 ± 0.578
3.542SerPhe: 3.542 ± 0.475
5.797SerGly: 5.797 ± 0.735
0.773SerHis: 0.773 ± 0.225
6.054SerIle: 6.054 ± 0.644
6.763SerLys: 6.763 ± 0.636
4.444SerLeu: 4.444 ± 0.505
1.353SerMet: 1.353 ± 0.346
3.478SerAsn: 3.478 ± 0.436
2.061SerPro: 2.061 ± 0.395
3.092SerGln: 3.092 ± 0.513
2.576SerArg: 2.576 ± 0.398
4.509SerSer: 4.509 ± 0.603
4.251SerThr: 4.251 ± 0.789
4.573SerVal: 4.573 ± 0.642
0.966SerTrp: 0.966 ± 0.245
2.705SerTyr: 2.705 ± 0.358
0.0SerXaa: 0.0 ± 0.0
Thr
3.671ThrAla: 3.671 ± 0.524
0.386ThrCys: 0.386 ± 0.157
3.092ThrAsp: 3.092 ± 0.429
4.444ThrGlu: 4.444 ± 0.768
2.383ThrPhe: 2.383 ± 0.368
4.122ThrGly: 4.122 ± 0.86
0.773ThrHis: 0.773 ± 0.237
4.122ThrIle: 4.122 ± 0.551
5.475ThrLys: 5.475 ± 0.663
4.38ThrLeu: 4.38 ± 0.597
1.417ThrMet: 1.417 ± 0.309
2.77ThrAsn: 2.77 ± 0.455
2.898ThrPro: 2.898 ± 0.412
1.481ThrGln: 1.481 ± 0.27
1.61ThrArg: 1.61 ± 0.336
4.251ThrSer: 4.251 ± 0.703
3.542ThrThr: 3.542 ± 0.651
3.993ThrVal: 3.993 ± 0.587
0.322ThrTrp: 0.322 ± 0.124
1.932ThrTyr: 1.932 ± 0.354
0.0ThrXaa: 0.0 ± 0.0
Val
2.77ValAla: 2.77 ± 0.472
0.515ValCys: 0.515 ± 0.209
3.542ValAsp: 3.542 ± 0.452
3.993ValGlu: 3.993 ± 0.457
2.254ValPhe: 2.254 ± 0.379
3.607ValGly: 3.607 ± 0.59
1.031ValHis: 1.031 ± 0.261
3.285ValIle: 3.285 ± 0.407
5.41ValLys: 5.41 ± 0.697
4.187ValLeu: 4.187 ± 0.399
1.739ValMet: 1.739 ± 0.457
2.963ValAsn: 2.963 ± 0.519
2.383ValPro: 2.383 ± 0.416
2.705ValGln: 2.705 ± 0.518
2.834ValArg: 2.834 ± 0.486
4.895ValSer: 4.895 ± 0.676
3.929ValThr: 3.929 ± 0.552
4.058ValVal: 4.058 ± 0.446
0.708ValTrp: 0.708 ± 0.257
1.932ValTyr: 1.932 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
0.515TrpAla: 0.515 ± 0.195
0.129TrpCys: 0.129 ± 0.086
0.773TrpAsp: 0.773 ± 0.186
0.515TrpGlu: 0.515 ± 0.163
0.129TrpPhe: 0.129 ± 0.085
0.515TrpGly: 0.515 ± 0.221
0.258TrpHis: 0.258 ± 0.132
1.353TrpIle: 1.353 ± 0.337
0.902TrpLys: 0.902 ± 0.24
1.353TrpLeu: 1.353 ± 0.343
0.193TrpMet: 0.193 ± 0.108
1.031TrpAsn: 1.031 ± 0.226
0.193TrpPro: 0.193 ± 0.124
0.258TrpGln: 0.258 ± 0.138
0.451TrpArg: 0.451 ± 0.2
0.451TrpSer: 0.451 ± 0.16
0.515TrpThr: 0.515 ± 0.156
0.773TrpVal: 0.773 ± 0.267
0.129TrpTrp: 0.129 ± 0.096
0.258TrpTyr: 0.258 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.19TyrAla: 2.19 ± 0.477
0.644TyrCys: 0.644 ± 0.209
2.77TyrAsp: 2.77 ± 0.379
2.834TyrGlu: 2.834 ± 0.372
1.997TyrPhe: 1.997 ± 0.414
1.803TyrGly: 1.803 ± 0.326
0.966TyrHis: 0.966 ± 0.276
2.19TyrIle: 2.19 ± 0.398
3.22TyrLys: 3.22 ± 0.432
2.963TyrLeu: 2.963 ± 0.475
0.966TyrMet: 0.966 ± 0.256
0.966TyrAsn: 0.966 ± 0.217
1.159TyrPro: 1.159 ± 0.272
1.675TyrGln: 1.675 ± 0.337
1.417TyrArg: 1.417 ± 0.286
2.77TyrSer: 2.77 ± 0.409
2.383TyrThr: 2.383 ± 0.427
1.932TyrVal: 1.932 ± 0.352
0.258TyrTrp: 0.258 ± 0.116
1.675TyrTyr: 1.675 ± 0.357
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (15527 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski