Amino acid dipepetide frequency for Microbacterium phage RubyRalph

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.005AlaAla: 15.005 ± 1.57
0.766AlaCys: 0.766 ± 0.249
7.655AlaAsp: 7.655 ± 0.502
7.043AlaGlu: 7.043 ± 0.692
2.909AlaPhe: 2.909 ± 0.362
8.727AlaGly: 8.727 ± 0.801
1.735AlaHis: 1.735 ± 0.345
5.767AlaIle: 5.767 ± 0.607
4.134AlaLys: 4.134 ± 0.428
10.309AlaLeu: 10.309 ± 1.171
2.144AlaMet: 2.144 ± 0.356
3.419AlaAsn: 3.419 ± 0.413
7.196AlaPro: 7.196 ± 0.701
4.287AlaGln: 4.287 ± 0.518
7.809AlaArg: 7.809 ± 0.579
6.277AlaSer: 6.277 ± 0.657
7.247AlaThr: 7.247 ± 0.776
9.493AlaVal: 9.493 ± 0.747
2.041AlaTrp: 2.041 ± 0.324
2.96AlaTyr: 2.96 ± 0.448
0.0AlaXaa: 0.0 ± 0.0
Cys
1.021CysAla: 1.021 ± 0.218
0.153CysCys: 0.153 ± 0.084
0.357CysAsp: 0.357 ± 0.146
0.561CysGlu: 0.561 ± 0.196
0.306CysPhe: 0.306 ± 0.139
1.531CysGly: 1.531 ± 0.33
0.153CysHis: 0.153 ± 0.091
0.357CysIle: 0.357 ± 0.141
0.051CysLys: 0.051 ± 0.053
0.306CysLeu: 0.306 ± 0.139
0.102CysMet: 0.102 ± 0.069
0.102CysAsn: 0.102 ± 0.064
1.021CysPro: 1.021 ± 0.218
0.204CysGln: 0.204 ± 0.089
0.715CysArg: 0.715 ± 0.223
0.459CysSer: 0.459 ± 0.146
0.255CysThr: 0.255 ± 0.112
0.51CysVal: 0.51 ± 0.173
0.204CysTrp: 0.204 ± 0.091
0.255CysTyr: 0.255 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
6.124AspAla: 6.124 ± 0.517
0.612AspCys: 0.612 ± 0.167
3.215AspAsp: 3.215 ± 0.373
4.848AspGlu: 4.848 ± 0.807
1.48AspPhe: 1.48 ± 0.24
6.584AspGly: 6.584 ± 0.679
1.072AspHis: 1.072 ± 0.288
2.603AspIle: 2.603 ± 0.366
1.48AspLys: 1.48 ± 0.317
5.767AspLeu: 5.767 ± 0.567
1.786AspMet: 1.786 ± 0.225
1.48AspAsn: 1.48 ± 0.243
4.287AspPro: 4.287 ± 0.453
2.297AspGln: 2.297 ± 0.277
4.032AspArg: 4.032 ± 0.588
2.705AspSer: 2.705 ± 0.366
3.062AspThr: 3.062 ± 0.417
4.44AspVal: 4.44 ± 0.426
1.48AspTrp: 1.48 ± 0.372
1.174AspTyr: 1.174 ± 0.252
0.0AspXaa: 0.0 ± 0.0
Glu
7.349GluAla: 7.349 ± 0.708
0.612GluCys: 0.612 ± 0.187
3.879GluAsp: 3.879 ± 0.554
4.797GluGlu: 4.797 ± 0.677
1.837GluPhe: 1.837 ± 0.269
5.41GluGly: 5.41 ± 0.73
1.939GluHis: 1.939 ± 0.357
3.828GluIle: 3.828 ± 0.436
2.654GluLys: 2.654 ± 0.529
6.277GluLeu: 6.277 ± 0.498
2.092GluMet: 2.092 ± 0.318
2.348GluAsn: 2.348 ± 0.323
3.624GluPro: 3.624 ± 0.501
4.083GluGln: 4.083 ± 0.486
6.124GluArg: 6.124 ± 0.565
2.909GluSer: 2.909 ± 0.412
3.113GluThr: 3.113 ± 0.292
4.032GluVal: 4.032 ± 0.466
1.582GluTrp: 1.582 ± 0.285
1.939GluTyr: 1.939 ± 0.321
0.0GluXaa: 0.0 ± 0.0
Phe
3.419PheAla: 3.419 ± 0.452
0.306PheCys: 0.306 ± 0.12
2.297PheAsp: 2.297 ± 0.399
1.837PheGlu: 1.837 ± 0.273
0.715PhePhe: 0.715 ± 0.223
2.96PheGly: 2.96 ± 0.455
0.306PheHis: 0.306 ± 0.12
0.919PheIle: 0.919 ± 0.176
0.561PheLys: 0.561 ± 0.186
2.501PheLeu: 2.501 ± 0.379
0.51PheMet: 0.51 ± 0.15
0.561PheAsn: 0.561 ± 0.158
1.531PhePro: 1.531 ± 0.27
0.715PheGln: 0.715 ± 0.167
2.041PheArg: 2.041 ± 0.348
1.531PheSer: 1.531 ± 0.278
1.735PheThr: 1.735 ± 0.311
1.735PheVal: 1.735 ± 0.285
0.561PheTrp: 0.561 ± 0.169
0.97PheTyr: 0.97 ± 0.191
0.0PheXaa: 0.0 ± 0.0
Gly
9.748GlyAla: 9.748 ± 0.955
0.97GlyCys: 0.97 ± 0.195
5.002GlyAsp: 5.002 ± 0.56
5.257GlyGlu: 5.257 ± 0.504
3.164GlyPhe: 3.164 ± 0.392
8.166GlyGly: 8.166 ± 0.722
1.684GlyHis: 1.684 ± 0.341
3.573GlyIle: 3.573 ± 0.409
3.317GlyLys: 3.317 ± 0.547
6.482GlyLeu: 6.482 ± 0.546
2.501GlyMet: 2.501 ± 0.392
2.348GlyAsn: 2.348 ± 0.46
3.419GlyPro: 3.419 ± 0.455
3.062GlyGln: 3.062 ± 0.414
6.482GlyArg: 6.482 ± 0.688
4.593GlySer: 4.593 ± 0.443
5.92GlyThr: 5.92 ± 0.711
6.533GlyVal: 6.533 ± 0.698
1.684GlyTrp: 1.684 ± 0.319
2.654GlyTyr: 2.654 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
1.786HisAla: 1.786 ± 0.34
0.153HisCys: 0.153 ± 0.112
1.072HisAsp: 1.072 ± 0.245
1.888HisGlu: 1.888 ± 0.317
0.51HisPhe: 0.51 ± 0.133
1.684HisGly: 1.684 ± 0.308
0.459HisHis: 0.459 ± 0.164
0.663HisIle: 0.663 ± 0.217
0.306HisLys: 0.306 ± 0.107
1.378HisLeu: 1.378 ± 0.252
0.255HisMet: 0.255 ± 0.111
0.459HisAsn: 0.459 ± 0.158
1.378HisPro: 1.378 ± 0.267
0.612HisGln: 0.612 ± 0.273
1.684HisArg: 1.684 ± 0.355
0.766HisSer: 0.766 ± 0.182
0.766HisThr: 0.766 ± 0.23
1.48HisVal: 1.48 ± 0.241
0.408HisTrp: 0.408 ± 0.119
0.663HisTyr: 0.663 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
6.431IleAla: 6.431 ± 0.629
0.408IleCys: 0.408 ± 0.171
3.317IleAsp: 3.317 ± 0.405
4.338IleGlu: 4.338 ± 0.427
1.123IlePhe: 1.123 ± 0.342
3.93IleGly: 3.93 ± 0.539
0.612IleHis: 0.612 ± 0.181
2.246IleIle: 2.246 ± 0.356
0.868IleLys: 0.868 ± 0.286
2.501IleLeu: 2.501 ± 0.418
1.225IleMet: 1.225 ± 0.213
1.378IleAsn: 1.378 ± 0.307
2.807IlePro: 2.807 ± 0.411
1.429IleGln: 1.429 ± 0.241
3.215IleArg: 3.215 ± 0.373
2.552IleSer: 2.552 ± 0.438
3.266IleThr: 3.266 ± 0.347
2.705IleVal: 2.705 ± 0.315
0.663IleTrp: 0.663 ± 0.189
0.97IleTyr: 0.97 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
4.083LysAla: 4.083 ± 0.468
0.204LysCys: 0.204 ± 0.146
1.429LysAsp: 1.429 ± 0.263
1.939LysGlu: 1.939 ± 0.321
0.612LysPhe: 0.612 ± 0.166
2.195LysGly: 2.195 ± 0.433
0.357LysHis: 0.357 ± 0.126
1.531LysIle: 1.531 ± 0.217
0.255LysLys: 0.255 ± 0.109
2.807LysLeu: 2.807 ± 0.425
0.766LysMet: 0.766 ± 0.203
0.715LysAsn: 0.715 ± 0.194
2.092LysPro: 2.092 ± 0.496
0.97LysGln: 0.97 ± 0.166
2.246LysArg: 2.246 ± 0.42
1.531LysSer: 1.531 ± 0.252
1.633LysThr: 1.633 ± 0.313
2.246LysVal: 2.246 ± 0.398
0.357LysTrp: 0.357 ± 0.145
0.561LysTyr: 0.561 ± 0.159
0.0LysXaa: 0.0 ± 0.0
Leu
10.309LeuAla: 10.309 ± 0.852
0.408LeuCys: 0.408 ± 0.135
6.073LeuAsp: 6.073 ± 0.789
5.92LeuGlu: 5.92 ± 0.535
1.786LeuPhe: 1.786 ± 0.32
6.175LeuGly: 6.175 ± 0.548
1.429LeuHis: 1.429 ± 0.308
4.032LeuIle: 4.032 ± 0.423
2.348LeuLys: 2.348 ± 0.386
5.512LeuLeu: 5.512 ± 0.534
1.276LeuMet: 1.276 ± 0.245
2.705LeuAsn: 2.705 ± 0.443
3.624LeuPro: 3.624 ± 0.402
1.786LeuGln: 1.786 ± 0.3
6.584LeuArg: 6.584 ± 0.639
5.461LeuSer: 5.461 ± 0.506
5.614LeuThr: 5.614 ± 0.453
5.308LeuVal: 5.308 ± 0.525
1.021LeuTrp: 1.021 ± 0.206
1.633LeuTyr: 1.633 ± 0.325
0.0LeuXaa: 0.0 ± 0.0
Met
2.807MetAla: 2.807 ± 0.384
0.153MetCys: 0.153 ± 0.076
1.327MetAsp: 1.327 ± 0.309
1.225MetGlu: 1.225 ± 0.273
0.153MetPhe: 0.153 ± 0.083
1.633MetGly: 1.633 ± 0.33
0.51MetHis: 0.51 ± 0.167
1.378MetIle: 1.378 ± 0.288
0.868MetLys: 0.868 ± 0.207
2.195MetLeu: 2.195 ± 0.32
0.817MetMet: 0.817 ± 0.218
0.715MetAsn: 0.715 ± 0.176
2.195MetPro: 2.195 ± 0.306
0.817MetGln: 0.817 ± 0.176
2.552MetArg: 2.552 ± 0.279
2.092MetSer: 2.092 ± 0.319
1.735MetThr: 1.735 ± 0.302
1.327MetVal: 1.327 ± 0.226
0.153MetTrp: 0.153 ± 0.083
0.255MetTyr: 0.255 ± 0.103
0.0MetXaa: 0.0 ± 0.0
Asn
3.521AsnAla: 3.521 ± 0.481
0.0AsnCys: 0.0 ± 0.0
1.633AsnAsp: 1.633 ± 0.291
1.837AsnGlu: 1.837 ± 0.288
0.715AsnPhe: 0.715 ± 0.192
3.215AsnGly: 3.215 ± 0.457
0.663AsnHis: 0.663 ± 0.16
0.97AsnIle: 0.97 ± 0.214
0.612AsnLys: 0.612 ± 0.159
2.909AsnLeu: 2.909 ± 0.38
0.612AsnMet: 0.612 ± 0.181
0.612AsnAsn: 0.612 ± 0.191
2.041AsnPro: 2.041 ± 0.32
0.715AsnGln: 0.715 ± 0.171
2.041AsnArg: 2.041 ± 0.344
1.735AsnSer: 1.735 ± 0.295
1.837AsnThr: 1.837 ± 0.265
1.582AsnVal: 1.582 ± 0.256
0.561AsnTrp: 0.561 ± 0.176
0.306AsnTyr: 0.306 ± 0.096
0.0AsnXaa: 0.0 ± 0.0
Pro
6.635ProAla: 6.635 ± 0.749
0.561ProCys: 0.561 ± 0.168
3.215ProAsp: 3.215 ± 0.413
4.797ProGlu: 4.797 ± 0.567
1.123ProPhe: 1.123 ± 0.26
5.41ProGly: 5.41 ± 0.653
0.766ProHis: 0.766 ± 0.152
1.939ProIle: 1.939 ± 0.308
1.531ProLys: 1.531 ± 0.34
3.726ProLeu: 3.726 ± 0.378
1.276ProMet: 1.276 ± 0.195
2.195ProAsn: 2.195 ± 0.394
3.062ProPro: 3.062 ± 0.487
2.654ProGln: 2.654 ± 0.842
3.521ProArg: 3.521 ± 0.376
3.726ProSer: 3.726 ± 0.466
3.93ProThr: 3.93 ± 0.386
5.206ProVal: 5.206 ± 0.623
1.072ProTrp: 1.072 ± 0.199
1.633ProTyr: 1.633 ± 0.29
0.0ProXaa: 0.0 ± 0.0
Gln
3.266GlnAla: 3.266 ± 0.569
0.204GlnCys: 0.204 ± 0.083
1.327GlnAsp: 1.327 ± 0.241
2.45GlnGlu: 2.45 ± 0.254
0.97GlnPhe: 0.97 ± 0.291
3.317GlnGly: 3.317 ± 0.775
0.919GlnHis: 0.919 ± 0.235
1.531GlnIle: 1.531 ± 0.315
0.868GlnLys: 0.868 ± 0.235
2.96GlnLeu: 2.96 ± 0.397
1.327GlnMet: 1.327 ± 0.268
0.663GlnAsn: 0.663 ± 0.194
2.552GlnPro: 2.552 ± 0.63
3.113GlnGln: 3.113 ± 1.539
3.368GlnArg: 3.368 ± 0.412
2.144GlnSer: 2.144 ± 0.342
1.786GlnThr: 1.786 ± 0.315
2.45GlnVal: 2.45 ± 0.345
0.663GlnTrp: 0.663 ± 0.198
1.072GlnTyr: 1.072 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
9.186ArgAla: 9.186 ± 0.623
1.276ArgCys: 1.276 ± 0.277
3.879ArgAsp: 3.879 ± 0.634
5.512ArgGlu: 5.512 ± 0.773
2.552ArgPhe: 2.552 ± 0.364
5.461ArgGly: 5.461 ± 0.645
1.48ArgHis: 1.48 ± 0.369
3.675ArgIle: 3.675 ± 0.45
2.348ArgLys: 2.348 ± 0.462
5.359ArgLeu: 5.359 ± 0.565
2.092ArgMet: 2.092 ± 0.327
2.399ArgAsn: 2.399 ± 0.365
3.981ArgPro: 3.981 ± 0.489
2.45ArgGln: 2.45 ± 0.325
6.533ArgArg: 6.533 ± 0.755
4.338ArgSer: 4.338 ± 0.45
3.368ArgThr: 3.368 ± 0.396
5.41ArgVal: 5.41 ± 0.498
1.531ArgTrp: 1.531 ± 0.283
1.633ArgTyr: 1.633 ± 0.242
0.0ArgXaa: 0.0 ± 0.0
Ser
5.92SerAla: 5.92 ± 0.668
0.51SerCys: 0.51 ± 0.142
3.675SerAsp: 3.675 ± 0.339
4.083SerGlu: 4.083 ± 0.508
1.888SerPhe: 1.888 ± 0.345
5.206SerGly: 5.206 ± 0.561
0.817SerHis: 0.817 ± 0.258
2.807SerIle: 2.807 ± 0.334
1.531SerLys: 1.531 ± 0.276
3.93SerLeu: 3.93 ± 0.337
1.99SerMet: 1.99 ± 0.31
1.174SerAsn: 1.174 ± 0.263
2.45SerPro: 2.45 ± 0.394
1.786SerGln: 1.786 ± 0.346
3.215SerArg: 3.215 ± 0.421
3.164SerSer: 3.164 ± 0.441
4.593SerThr: 4.593 ± 0.553
4.134SerVal: 4.134 ± 0.439
1.021SerTrp: 1.021 ± 0.288
1.99SerTyr: 1.99 ± 0.367
0.0SerXaa: 0.0 ± 0.0
Thr
6.941ThrAla: 6.941 ± 0.666
0.306ThrCys: 0.306 ± 0.134
3.726ThrAsp: 3.726 ± 0.413
3.726ThrGlu: 3.726 ± 0.466
2.092ThrPhe: 2.092 ± 0.322
5.614ThrGly: 5.614 ± 0.656
1.072ThrHis: 1.072 ± 0.233
2.807ThrIle: 2.807 ± 0.345
1.888ThrLys: 1.888 ± 0.29
5.461ThrLeu: 5.461 ± 0.455
1.327ThrMet: 1.327 ± 0.216
1.786ThrAsn: 1.786 ± 0.35
5.614ThrPro: 5.614 ± 0.608
1.786ThrGln: 1.786 ± 0.265
4.032ThrArg: 4.032 ± 0.528
3.317ThrSer: 3.317 ± 0.366
3.879ThrThr: 3.879 ± 0.525
4.695ThrVal: 4.695 ± 0.458
1.378ThrTrp: 1.378 ± 0.214
1.174ThrTyr: 1.174 ± 0.252
0.0ThrXaa: 0.0 ± 0.0
Val
7.86ValAla: 7.86 ± 0.542
0.561ValCys: 0.561 ± 0.18
4.95ValAsp: 4.95 ± 0.505
4.848ValGlu: 4.848 ± 0.506
1.939ValPhe: 1.939 ± 0.31
5.818ValGly: 5.818 ± 0.503
1.684ValHis: 1.684 ± 0.267
3.521ValIle: 3.521 ± 0.352
1.99ValLys: 1.99 ± 0.357
5.053ValLeu: 5.053 ± 0.506
2.246ValMet: 2.246 ± 0.365
1.99ValAsn: 1.99 ± 0.288
3.368ValPro: 3.368 ± 0.427
2.705ValGln: 2.705 ± 0.393
4.695ValArg: 4.695 ± 0.519
3.93ValSer: 3.93 ± 0.442
6.226ValThr: 6.226 ± 0.539
5.41ValVal: 5.41 ± 0.595
1.633ValTrp: 1.633 ± 0.345
1.939ValTyr: 1.939 ± 0.372
0.0ValXaa: 0.0 ± 0.0
Trp
2.45TrpAla: 2.45 ± 0.374
0.153TrpCys: 0.153 ± 0.092
0.97TrpAsp: 0.97 ± 0.197
1.327TrpGlu: 1.327 ± 0.338
0.868TrpPhe: 0.868 ± 0.39
0.868TrpGly: 0.868 ± 0.248
0.306TrpHis: 0.306 ± 0.124
1.174TrpIle: 1.174 ± 0.255
0.561TrpLys: 0.561 ± 0.169
1.429TrpLeu: 1.429 ± 0.246
0.255TrpMet: 0.255 ± 0.102
0.51TrpAsn: 0.51 ± 0.123
0.612TrpPro: 0.612 ± 0.15
0.663TrpGln: 0.663 ± 0.182
1.429TrpArg: 1.429 ± 0.241
1.531TrpSer: 1.531 ± 0.254
1.225TrpThr: 1.225 ± 0.283
1.684TrpVal: 1.684 ± 0.32
0.715TrpTrp: 0.715 ± 0.299
0.561TrpTyr: 0.561 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.062TyrAla: 3.062 ± 0.35
0.306TyrCys: 0.306 ± 0.129
1.531TyrAsp: 1.531 ± 0.227
2.195TyrGlu: 2.195 ± 0.378
0.97TyrPhe: 0.97 ± 0.217
2.654TyrGly: 2.654 ± 0.436
0.408TyrHis: 0.408 ± 0.155
0.663TyrIle: 0.663 ± 0.153
0.357TyrLys: 0.357 ± 0.119
2.195TyrLeu: 2.195 ± 0.327
0.255TyrMet: 0.255 ± 0.094
0.561TyrAsn: 0.561 ± 0.187
1.123TyrPro: 1.123 ± 0.263
0.817TyrGln: 0.817 ± 0.178
2.144TyrArg: 2.144 ± 0.393
1.123TyrSer: 1.123 ± 0.173
1.429TyrThr: 1.429 ± 0.197
1.99TyrVal: 1.99 ± 0.32
0.561TyrTrp: 0.561 ± 0.212
0.408TyrTyr: 0.408 ± 0.144
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 102 proteins (19595 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski