Amino acid dipepetide frequency for Microbacterium phage Tyrumbra

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.919AlaAla: 16.919 ± 1.739
0.703AlaCys: 0.703 ± 0.248
7.611AlaAsp: 7.611 ± 0.921
8.021AlaGlu: 8.021 ± 0.949
3.103AlaPhe: 3.103 ± 0.455
10.187AlaGly: 10.187 ± 1.025
2.225AlaHis: 2.225 ± 0.408
5.269AlaIle: 5.269 ± 0.682
3.981AlaLys: 3.981 ± 0.526
11.358AlaLeu: 11.358 ± 1.23
3.22AlaMet: 3.22 ± 0.473
3.103AlaAsn: 3.103 ± 0.397
5.913AlaPro: 5.913 ± 0.464
3.454AlaGln: 3.454 ± 0.489
8.665AlaArg: 8.665 ± 0.806
6.557AlaSer: 6.557 ± 0.702
7.494AlaThr: 7.494 ± 0.873
7.025AlaVal: 7.025 ± 0.613
2.459AlaTrp: 2.459 ± 0.364
2.108AlaTyr: 2.108 ± 0.308
0.0AlaXaa: 0.0 ± 0.0
Cys
0.703CysAla: 0.703 ± 0.22
0.234CysCys: 0.234 ± 0.154
1.464CysAsp: 1.464 ± 0.372
0.41CysGlu: 0.41 ± 0.168
0.176CysPhe: 0.176 ± 0.097
1.288CysGly: 1.288 ± 0.311
0.351CysHis: 0.351 ± 0.135
0.293CysIle: 0.293 ± 0.133
0.0CysLys: 0.0 ± 0.0
0.468CysLeu: 0.468 ± 0.147
0.059CysMet: 0.059 ± 0.061
0.234CysAsn: 0.234 ± 0.112
0.703CysPro: 0.703 ± 0.185
0.117CysGln: 0.117 ± 0.08
0.585CysArg: 0.585 ± 0.191
0.176CysSer: 0.176 ± 0.076
0.585CysThr: 0.585 ± 0.173
0.703CysVal: 0.703 ± 0.195
0.117CysTrp: 0.117 ± 0.079
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.196AspAla: 8.196 ± 0.73
0.585AspCys: 0.585 ± 0.197
4.625AspAsp: 4.625 ± 0.642
4.566AspGlu: 4.566 ± 0.689
1.698AspPhe: 1.698 ± 0.319
6.674AspGly: 6.674 ± 0.819
0.995AspHis: 0.995 ± 0.243
2.986AspIle: 2.986 ± 0.388
1.288AspLys: 1.288 ± 0.284
5.269AspLeu: 5.269 ± 0.634
1.581AspMet: 1.581 ± 0.354
1.991AspAsn: 1.991 ± 0.34
3.396AspPro: 3.396 ± 0.587
2.517AspGln: 2.517 ± 0.372
4.742AspArg: 4.742 ± 0.645
3.044AspSer: 3.044 ± 0.434
3.513AspThr: 3.513 ± 0.482
4.566AspVal: 4.566 ± 0.558
1.229AspTrp: 1.229 ± 0.29
1.756AspTyr: 1.756 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
8.372GluAla: 8.372 ± 0.989
0.41GluCys: 0.41 ± 0.179
4.157GluAsp: 4.157 ± 0.641
4.098GluGlu: 4.098 ± 0.679
1.464GluPhe: 1.464 ± 0.288
5.035GluGly: 5.035 ± 0.718
1.873GluHis: 1.873 ± 0.399
1.171GluIle: 1.171 ± 0.364
1.991GluLys: 1.991 ± 0.415
3.513GluLeu: 3.513 ± 0.466
1.112GluMet: 1.112 ± 0.261
1.581GluAsn: 1.581 ± 0.408
4.976GluPro: 4.976 ± 0.766
1.932GluGln: 1.932 ± 0.418
4.918GluArg: 4.918 ± 0.607
3.278GluSer: 3.278 ± 0.466
3.747GluThr: 3.747 ± 0.61
5.913GluVal: 5.913 ± 0.686
1.581GluTrp: 1.581 ± 0.326
1.522GluTyr: 1.522 ± 0.337
0.0GluXaa: 0.0 ± 0.0
Phe
2.927PheAla: 2.927 ± 0.38
0.351PheCys: 0.351 ± 0.187
2.108PheAsp: 2.108 ± 0.402
1.698PheGlu: 1.698 ± 0.305
0.527PhePhe: 0.527 ± 0.195
2.342PheGly: 2.342 ± 0.343
0.41PheHis: 0.41 ± 0.163
1.581PheIle: 1.581 ± 0.304
0.527PheLys: 0.527 ± 0.191
1.756PheLeu: 1.756 ± 0.409
0.527PheMet: 0.527 ± 0.179
0.41PheAsn: 0.41 ± 0.167
0.995PhePro: 0.995 ± 0.231
0.761PheGln: 0.761 ± 0.259
1.229PheArg: 1.229 ± 0.262
1.229PheSer: 1.229 ± 0.27
2.693PheThr: 2.693 ± 0.412
2.225PheVal: 2.225 ± 0.333
0.351PheTrp: 0.351 ± 0.126
0.703PheTyr: 0.703 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
7.318GlyAla: 7.318 ± 0.951
0.761GlyCys: 0.761 ± 0.251
6.616GlyAsp: 6.616 ± 0.696
5.386GlyGlu: 5.386 ± 0.549
2.517GlyPhe: 2.517 ± 0.426
8.021GlyGly: 8.021 ± 0.757
1.873GlyHis: 1.873 ± 0.353
4.742GlyIle: 4.742 ± 0.63
2.225GlyLys: 2.225 ± 0.379
7.318GlyLeu: 7.318 ± 1.157
2.869GlyMet: 2.869 ± 0.482
2.225GlyAsn: 2.225 ± 0.331
3.22GlyPro: 3.22 ± 0.435
2.986GlyGln: 2.986 ± 0.401
5.21GlyArg: 5.21 ± 0.615
5.972GlySer: 5.972 ± 0.799
6.791GlyThr: 6.791 ± 0.873
6.908GlyVal: 6.908 ± 0.613
2.752GlyTrp: 2.752 ± 0.451
2.927GlyTyr: 2.927 ± 0.453
0.0GlyXaa: 0.0 ± 0.0
His
2.108HisAla: 2.108 ± 0.349
0.117HisCys: 0.117 ± 0.085
1.171HisAsp: 1.171 ± 0.263
1.347HisGlu: 1.347 ± 0.258
0.527HisPhe: 0.527 ± 0.151
1.815HisGly: 1.815 ± 0.424
0.293HisHis: 0.293 ± 0.109
0.761HisIle: 0.761 ± 0.231
0.41HisLys: 0.41 ± 0.167
2.108HisLeu: 2.108 ± 0.429
0.351HisMet: 0.351 ± 0.143
0.293HisAsn: 0.293 ± 0.143
1.522HisPro: 1.522 ± 0.313
0.41HisGln: 0.41 ± 0.177
1.522HisArg: 1.522 ± 0.345
1.054HisSer: 1.054 ± 0.238
1.347HisThr: 1.347 ± 0.309
1.229HisVal: 1.229 ± 0.294
0.176HisTrp: 0.176 ± 0.1
0.644HisTyr: 0.644 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
5.21IleAla: 5.21 ± 0.474
0.351IleCys: 0.351 ± 0.182
2.693IleAsp: 2.693 ± 0.455
4.215IleGlu: 4.215 ± 0.646
0.644IlePhe: 0.644 ± 0.18
3.864IleGly: 3.864 ± 0.523
0.878IleHis: 0.878 ± 0.214
2.517IleIle: 2.517 ± 0.359
1.288IleLys: 1.288 ± 0.248
2.752IleLeu: 2.752 ± 0.32
0.937IleMet: 0.937 ± 0.216
1.054IleAsn: 1.054 ± 0.288
3.103IlePro: 3.103 ± 0.694
1.229IleGln: 1.229 ± 0.313
2.869IleArg: 2.869 ± 0.481
2.635IleSer: 2.635 ± 0.444
4.391IleThr: 4.391 ± 0.588
4.157IleVal: 4.157 ± 0.505
0.878IleTrp: 0.878 ± 0.246
0.82IleTyr: 0.82 ± 0.204
0.0IleXaa: 0.0 ± 0.0
Lys
3.805LysAla: 3.805 ± 0.46
0.41LysCys: 0.41 ± 0.162
1.288LysAsp: 1.288 ± 0.255
0.703LysGlu: 0.703 ± 0.19
0.644LysPhe: 0.644 ± 0.199
2.342LysGly: 2.342 ± 0.372
0.527LysHis: 0.527 ± 0.197
1.229LysIle: 1.229 ± 0.286
0.937LysLys: 0.937 ± 0.314
0.995LysLeu: 0.995 ± 0.333
0.351LysMet: 0.351 ± 0.149
0.644LysAsn: 0.644 ± 0.204
1.288LysPro: 1.288 ± 0.325
1.112LysGln: 1.112 ± 0.282
2.81LysArg: 2.81 ± 0.522
2.108LysSer: 2.108 ± 0.38
0.761LysThr: 0.761 ± 0.227
2.752LysVal: 2.752 ± 0.46
0.293LysTrp: 0.293 ± 0.132
0.82LysTyr: 0.82 ± 0.225
0.0LysXaa: 0.0 ± 0.0
Leu
9.894LeuAla: 9.894 ± 0.909
0.468LeuCys: 0.468 ± 0.172
5.972LeuAsp: 5.972 ± 0.636
4.098LeuGlu: 4.098 ± 0.543
1.639LeuPhe: 1.639 ± 0.419
7.142LeuGly: 7.142 ± 1.112
1.229LeuHis: 1.229 ± 0.303
4.625LeuIle: 4.625 ± 0.55
1.112LeuLys: 1.112 ± 0.321
6.03LeuLeu: 6.03 ± 0.783
1.639LeuMet: 1.639 ± 0.296
1.815LeuAsn: 1.815 ± 0.3
4.566LeuPro: 4.566 ± 0.436
2.166LeuGln: 2.166 ± 0.396
5.328LeuArg: 5.328 ± 0.687
4.566LeuSer: 4.566 ± 0.628
7.26LeuThr: 7.26 ± 0.588
6.03LeuVal: 6.03 ± 0.695
1.112LeuTrp: 1.112 ± 0.223
1.464LeuTyr: 1.464 ± 0.355
0.0LeuXaa: 0.0 ± 0.0
Met
2.576MetAla: 2.576 ± 0.508
0.176MetCys: 0.176 ± 0.119
1.171MetAsp: 1.171 ± 0.238
0.761MetGlu: 0.761 ± 0.178
0.527MetPhe: 0.527 ± 0.19
1.932MetGly: 1.932 ± 0.363
0.41MetHis: 0.41 ± 0.145
1.347MetIle: 1.347 ± 0.307
0.468MetLys: 0.468 ± 0.162
1.522MetLeu: 1.522 ± 0.316
0.234MetMet: 0.234 ± 0.112
0.761MetAsn: 0.761 ± 0.202
1.464MetPro: 1.464 ± 0.407
0.82MetGln: 0.82 ± 0.234
1.171MetArg: 1.171 ± 0.245
2.927MetSer: 2.927 ± 0.461
2.752MetThr: 2.752 ± 0.377
1.229MetVal: 1.229 ± 0.255
0.41MetTrp: 0.41 ± 0.149
0.703MetTyr: 0.703 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
3.396AsnAla: 3.396 ± 0.397
0.351AsnCys: 0.351 ± 0.152
1.054AsnAsp: 1.054 ± 0.206
1.054AsnGlu: 1.054 ± 0.323
0.41AsnPhe: 0.41 ± 0.161
3.044AsnGly: 3.044 ± 0.436
0.527AsnHis: 0.527 ± 0.183
0.82AsnIle: 0.82 ± 0.231
0.527AsnLys: 0.527 ± 0.16
3.337AsnLeu: 3.337 ± 0.377
0.351AsnMet: 0.351 ± 0.189
0.293AsnAsn: 0.293 ± 0.133
2.225AsnPro: 2.225 ± 0.397
0.82AsnGln: 0.82 ± 0.226
2.108AsnArg: 2.108 ± 0.388
1.698AsnSer: 1.698 ± 0.319
1.815AsnThr: 1.815 ± 0.313
1.581AsnVal: 1.581 ± 0.254
0.176AsnTrp: 0.176 ± 0.108
0.995AsnTyr: 0.995 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
6.03ProAla: 6.03 ± 0.562
0.644ProCys: 0.644 ± 0.214
3.747ProAsp: 3.747 ± 0.689
5.269ProGlu: 5.269 ± 0.753
1.522ProPhe: 1.522 ± 0.306
5.269ProGly: 5.269 ± 0.599
1.347ProHis: 1.347 ± 0.278
1.756ProIle: 1.756 ± 0.324
1.522ProLys: 1.522 ± 0.316
3.513ProLeu: 3.513 ± 0.41
1.581ProMet: 1.581 ± 0.34
1.991ProAsn: 1.991 ± 0.432
3.805ProPro: 3.805 ± 0.471
2.166ProGln: 2.166 ± 0.534
3.161ProArg: 3.161 ± 0.591
2.927ProSer: 2.927 ± 0.414
3.922ProThr: 3.922 ± 0.763
4.918ProVal: 4.918 ± 0.626
0.878ProTrp: 0.878 ± 0.22
1.522ProTyr: 1.522 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
4.215GlnAla: 4.215 ± 0.705
0.293GlnCys: 0.293 ± 0.129
1.522GlnAsp: 1.522 ± 0.272
1.639GlnGlu: 1.639 ± 0.288
1.347GlnPhe: 1.347 ± 0.308
2.342GlnGly: 2.342 ± 0.433
0.995GlnHis: 0.995 ± 0.284
1.698GlnIle: 1.698 ± 0.353
0.644GlnLys: 0.644 ± 0.24
1.054GlnLeu: 1.054 ± 0.519
0.995GlnMet: 0.995 ± 0.208
1.171GlnAsn: 1.171 ± 0.213
1.873GlnPro: 1.873 ± 0.312
1.112GlnGln: 1.112 ± 0.303
2.635GlnArg: 2.635 ± 0.417
1.756GlnSer: 1.756 ± 0.406
1.698GlnThr: 1.698 ± 0.329
2.752GlnVal: 2.752 ± 0.371
0.644GlnTrp: 0.644 ± 0.219
0.995GlnTyr: 0.995 ± 0.222
0.0GlnXaa: 0.0 ± 0.0
Arg
8.43ArgAla: 8.43 ± 0.818
0.761ArgCys: 0.761 ± 0.18
4.859ArgAsp: 4.859 ± 0.648
4.098ArgGlu: 4.098 ± 0.653
1.873ArgPhe: 1.873 ± 0.362
5.562ArgGly: 5.562 ± 0.679
1.347ArgHis: 1.347 ± 0.288
3.454ArgIle: 3.454 ± 0.553
2.166ArgLys: 2.166 ± 0.408
6.733ArgLeu: 6.733 ± 0.643
1.756ArgMet: 1.756 ± 0.336
1.639ArgAsn: 1.639 ± 0.386
2.869ArgPro: 2.869 ± 0.507
2.4ArgGln: 2.4 ± 0.404
6.206ArgArg: 6.206 ± 0.913
3.63ArgSer: 3.63 ± 0.483
4.449ArgThr: 4.449 ± 0.69
4.918ArgVal: 4.918 ± 0.61
1.698ArgTrp: 1.698 ± 0.338
2.108ArgTyr: 2.108 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
7.025SerAla: 7.025 ± 1.015
0.234SerCys: 0.234 ± 0.168
3.981SerAsp: 3.981 ± 0.508
3.161SerGlu: 3.161 ± 0.462
2.225SerPhe: 2.225 ± 0.417
5.679SerGly: 5.679 ± 0.775
0.937SerHis: 0.937 ± 0.242
3.278SerIle: 3.278 ± 0.45
1.464SerLys: 1.464 ± 0.295
4.157SerLeu: 4.157 ± 0.7
1.639SerMet: 1.639 ± 0.261
2.049SerAsn: 2.049 ± 0.356
3.161SerPro: 3.161 ± 0.439
1.464SerGln: 1.464 ± 0.284
3.571SerArg: 3.571 ± 0.388
2.927SerSer: 2.927 ± 0.624
4.566SerThr: 4.566 ± 0.603
4.508SerVal: 4.508 ± 0.469
0.878SerTrp: 0.878 ± 0.217
0.995SerTyr: 0.995 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
8.899ThrAla: 8.899 ± 0.946
0.527ThrCys: 0.527 ± 0.161
3.922ThrAsp: 3.922 ± 0.55
4.215ThrGlu: 4.215 ± 0.7
1.698ThrPhe: 1.698 ± 0.349
6.908ThrGly: 6.908 ± 0.788
0.878ThrHis: 0.878 ± 0.223
3.688ThrIle: 3.688 ± 0.336
1.815ThrLys: 1.815 ± 0.376
6.674ThrLeu: 6.674 ± 0.665
1.171ThrMet: 1.171 ± 0.223
1.932ThrAsn: 1.932 ± 0.351
5.093ThrPro: 5.093 ± 0.677
1.815ThrGln: 1.815 ± 0.288
4.449ThrArg: 4.449 ± 0.528
3.922ThrSer: 3.922 ± 0.617
4.332ThrThr: 4.332 ± 0.766
6.616ThrVal: 6.616 ± 0.629
1.229ThrTrp: 1.229 ± 0.352
1.581ThrTyr: 1.581 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
8.313ValAla: 8.313 ± 0.827
0.82ValCys: 0.82 ± 0.258
3.922ValAsp: 3.922 ± 0.475
5.093ValGlu: 5.093 ± 0.5
1.639ValPhe: 1.639 ± 0.356
5.679ValGly: 5.679 ± 0.679
1.464ValHis: 1.464 ± 0.282
4.098ValIle: 4.098 ± 0.493
2.166ValLys: 2.166 ± 0.287
6.147ValLeu: 6.147 ± 0.567
1.873ValMet: 1.873 ± 0.374
2.049ValAsn: 2.049 ± 0.318
5.386ValPro: 5.386 ± 0.591
2.576ValGln: 2.576 ± 0.436
5.562ValArg: 5.562 ± 0.607
4.859ValSer: 4.859 ± 0.484
6.791ValThr: 6.791 ± 0.711
5.913ValVal: 5.913 ± 0.599
1.698ValTrp: 1.698 ± 0.334
1.347ValTyr: 1.347 ± 0.317
0.0ValXaa: 0.0 ± 0.0
Trp
1.932TrpAla: 1.932 ± 0.301
0.351TrpCys: 0.351 ± 0.145
0.937TrpAsp: 0.937 ± 0.252
1.229TrpGlu: 1.229 ± 0.273
0.644TrpPhe: 0.644 ± 0.208
1.405TrpGly: 1.405 ± 0.275
0.293TrpHis: 0.293 ± 0.125
0.703TrpIle: 0.703 ± 0.22
0.878TrpLys: 0.878 ± 0.227
1.347TrpLeu: 1.347 ± 0.297
0.293TrpMet: 0.293 ± 0.122
0.82TrpAsn: 0.82 ± 0.236
0.937TrpPro: 0.937 ± 0.247
0.937TrpGln: 0.937 ± 0.221
1.756TrpArg: 1.756 ± 0.327
1.464TrpSer: 1.464 ± 0.32
0.878TrpThr: 0.878 ± 0.222
1.522TrpVal: 1.522 ± 0.26
0.703TrpTrp: 0.703 ± 0.253
0.82TrpTyr: 0.82 ± 0.253
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.103TyrAla: 3.103 ± 0.53
0.117TyrCys: 0.117 ± 0.083
2.4TyrAsp: 2.4 ± 0.415
1.581TyrGlu: 1.581 ± 0.309
0.527TyrPhe: 0.527 ± 0.184
2.108TyrGly: 2.108 ± 0.426
0.351TyrHis: 0.351 ± 0.13
0.468TyrIle: 0.468 ± 0.156
0.41TyrLys: 0.41 ± 0.156
2.108TyrLeu: 2.108 ± 0.361
0.761TyrMet: 0.761 ± 0.226
0.527TyrAsn: 0.527 ± 0.172
0.995TyrPro: 0.995 ± 0.264
0.585TyrGln: 0.585 ± 0.175
2.459TyrArg: 2.459 ± 0.457
1.171TyrSer: 1.171 ± 0.283
1.581TyrThr: 1.581 ± 0.322
1.991TyrVal: 1.991 ± 0.355
0.585TyrTrp: 0.585 ± 0.178
0.351TyrTyr: 0.351 ± 0.131
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (17082 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski