Amino acid dipepetide frequency for Microbacterium phage Hubbs

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.109AlaAla: 12.109 ± 1.089
0.577AlaCys: 0.577 ± 0.185
5.295AlaAsp: 5.295 ± 0.558
7.863AlaGlu: 7.863 ± 0.698
3.198AlaPhe: 3.198 ± 0.418
6.658AlaGly: 6.658 ± 0.996
1.835AlaHis: 1.835 ± 0.324
5.347AlaIle: 5.347 ± 0.509
4.666AlaLys: 4.666 ± 0.609
9.121AlaLeu: 9.121 ± 0.742
3.303AlaMet: 3.303 ± 0.409
4.089AlaAsn: 4.089 ± 0.484
3.407AlaPro: 3.407 ± 0.327
3.932AlaGln: 3.932 ± 0.453
6.5AlaArg: 6.5 ± 0.666
6.186AlaSer: 6.186 ± 0.762
5.819AlaThr: 5.819 ± 0.637
6.553AlaVal: 6.553 ± 0.572
2.044AlaTrp: 2.044 ± 0.288
2.307AlaTyr: 2.307 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.262CysAla: 0.262 ± 0.102
0.052CysCys: 0.052 ± 0.05
0.367CysAsp: 0.367 ± 0.138
0.472CysGlu: 0.472 ± 0.206
0.262CysPhe: 0.262 ± 0.102
0.681CysGly: 0.681 ± 0.233
0.105CysHis: 0.105 ± 0.074
0.262CysIle: 0.262 ± 0.12
0.262CysLys: 0.262 ± 0.125
0.105CysLeu: 0.105 ± 0.066
0.052CysMet: 0.052 ± 0.056
0.157CysAsn: 0.157 ± 0.076
0.262CysPro: 0.262 ± 0.15
0.315CysGln: 0.315 ± 0.122
0.472CysArg: 0.472 ± 0.207
0.157CysSer: 0.157 ± 0.09
0.052CysThr: 0.052 ± 0.056
0.315CysVal: 0.315 ± 0.129
0.105CysTrp: 0.105 ± 0.07
0.105CysTyr: 0.105 ± 0.079
0.0CysXaa: 0.0 ± 0.0
Asp
6.867AspAla: 6.867 ± 0.745
0.524AspCys: 0.524 ± 0.185
4.77AspAsp: 4.77 ± 0.445
5.137AspGlu: 5.137 ± 0.47
2.097AspPhe: 2.097 ± 0.33
5.819AspGly: 5.819 ± 0.534
0.996AspHis: 0.996 ± 0.226
3.145AspIle: 3.145 ± 0.508
2.464AspLys: 2.464 ± 0.458
4.613AspLeu: 4.613 ± 0.564
1.678AspMet: 1.678 ± 0.296
1.94AspAsn: 1.94 ± 0.338
3.565AspPro: 3.565 ± 0.495
2.097AspGln: 2.097 ± 0.335
3.407AspArg: 3.407 ± 0.403
3.722AspSer: 3.722 ± 0.482
3.827AspThr: 3.827 ± 0.556
4.351AspVal: 4.351 ± 0.426
1.678AspTrp: 1.678 ± 0.24
2.726AspTyr: 2.726 ± 0.363
0.0AspXaa: 0.0 ± 0.0
Glu
6.553GluAla: 6.553 ± 0.596
0.419GluCys: 0.419 ± 0.162
4.613GluAsp: 4.613 ± 0.473
5.347GluGlu: 5.347 ± 0.609
2.569GluPhe: 2.569 ± 0.35
4.823GluGly: 4.823 ± 0.46
1.468GluHis: 1.468 ± 0.265
4.718GluIle: 4.718 ± 0.43
4.456GluLys: 4.456 ± 0.517
4.823GluLeu: 4.823 ± 0.492
2.044GluMet: 2.044 ± 0.364
2.778GluAsn: 2.778 ± 0.366
2.359GluPro: 2.359 ± 0.35
3.879GluGln: 3.879 ± 0.505
5.452GluArg: 5.452 ± 0.584
3.617GluSer: 3.617 ± 0.46
4.351GluThr: 4.351 ± 0.499
6.238GluVal: 6.238 ± 0.595
1.678GluTrp: 1.678 ± 0.263
3.145GluTyr: 3.145 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
3.145PheAla: 3.145 ± 0.365
0.157PheCys: 0.157 ± 0.09
2.831PheAsp: 2.831 ± 0.406
2.674PheGlu: 2.674 ± 0.407
0.577PhePhe: 0.577 ± 0.143
2.411PheGly: 2.411 ± 0.389
0.681PheHis: 0.681 ± 0.169
1.468PheIle: 1.468 ± 0.264
1.206PheLys: 1.206 ± 0.228
2.254PheLeu: 2.254 ± 0.396
1.258PheMet: 1.258 ± 0.26
1.311PheAsn: 1.311 ± 0.351
1.153PhePro: 1.153 ± 0.233
1.73PheGln: 1.73 ± 0.284
2.149PheArg: 2.149 ± 0.334
1.678PheSer: 1.678 ± 0.322
2.831PheThr: 2.831 ± 0.379
2.359PheVal: 2.359 ± 0.301
0.472PheTrp: 0.472 ± 0.154
1.311PheTyr: 1.311 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
6.448GlyAla: 6.448 ± 0.699
0.315GlyCys: 0.315 ± 0.109
5.609GlyAsp: 5.609 ± 0.523
6.553GlyGlu: 6.553 ± 0.599
3.722GlyPhe: 3.722 ± 0.549
6.605GlyGly: 6.605 ± 0.8
1.835GlyHis: 1.835 ± 0.308
4.403GlyIle: 4.403 ± 0.603
2.988GlyLys: 2.988 ± 0.504
6.029GlyLeu: 6.029 ± 0.749
1.992GlyMet: 1.992 ± 0.334
2.778GlyAsn: 2.778 ± 0.44
2.044GlyPro: 2.044 ± 0.312
2.778GlyGln: 2.778 ± 0.393
5.871GlyArg: 5.871 ± 0.514
4.613GlySer: 4.613 ± 0.496
5.504GlyThr: 5.504 ± 0.686
5.557GlyVal: 5.557 ± 0.564
2.149GlyTrp: 2.149 ± 0.41
2.883GlyTyr: 2.883 ± 0.359
0.0GlyXaa: 0.0 ± 0.0
His
1.625HisAla: 1.625 ± 0.295
0.052HisCys: 0.052 ± 0.055
0.944HisAsp: 0.944 ± 0.213
1.415HisGlu: 1.415 ± 0.302
0.472HisPhe: 0.472 ± 0.133
1.415HisGly: 1.415 ± 0.376
0.157HisHis: 0.157 ± 0.091
0.944HisIle: 0.944 ± 0.215
0.786HisLys: 0.786 ± 0.182
1.363HisLeu: 1.363 ± 0.327
0.472HisMet: 0.472 ± 0.146
0.734HisAsn: 0.734 ± 0.218
1.258HisPro: 1.258 ± 0.248
0.524HisGln: 0.524 ± 0.15
0.996HisArg: 0.996 ± 0.266
1.101HisSer: 1.101 ± 0.219
1.258HisThr: 1.258 ± 0.247
1.101HisVal: 1.101 ± 0.196
0.21HisTrp: 0.21 ± 0.098
0.681HisTyr: 0.681 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
4.98IleAla: 4.98 ± 0.532
0.105IleCys: 0.105 ± 0.076
3.984IleAsp: 3.984 ± 0.345
5.19IleGlu: 5.19 ± 0.515
1.678IlePhe: 1.678 ± 0.233
3.827IleGly: 3.827 ± 0.478
1.101IleHis: 1.101 ± 0.234
2.254IleIle: 2.254 ± 0.328
2.569IleLys: 2.569 ± 0.398
3.932IleLeu: 3.932 ± 0.442
0.786IleMet: 0.786 ± 0.197
2.202IleAsn: 2.202 ± 0.425
2.359IlePro: 2.359 ± 0.332
2.464IleGln: 2.464 ± 0.37
4.246IleArg: 4.246 ± 0.425
2.936IleSer: 2.936 ± 0.417
3.25IleThr: 3.25 ± 0.334
3.303IleVal: 3.303 ± 0.385
0.786IleTrp: 0.786 ± 0.192
1.258IleTyr: 1.258 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
5.295LysAla: 5.295 ± 0.662
0.262LysCys: 0.262 ± 0.151
2.988LysAsp: 2.988 ± 0.506
2.569LysGlu: 2.569 ± 0.364
1.835LysPhe: 1.835 ± 0.309
3.303LysGly: 3.303 ± 0.465
0.891LysHis: 0.891 ± 0.195
3.04LysIle: 3.04 ± 0.413
3.145LysLys: 3.145 ± 0.426
3.879LysLeu: 3.879 ± 0.456
1.153LysMet: 1.153 ± 0.279
1.101LysAsn: 1.101 ± 0.283
1.573LysPro: 1.573 ± 0.349
1.468LysGln: 1.468 ± 0.306
3.355LysArg: 3.355 ± 0.521
2.569LysSer: 2.569 ± 0.407
2.359LysThr: 2.359 ± 0.403
3.565LysVal: 3.565 ± 0.415
0.577LysTrp: 0.577 ± 0.172
1.101LysTyr: 1.101 ± 0.229
0.0LysXaa: 0.0 ± 0.0
Leu
7.496LeuAla: 7.496 ± 0.695
0.262LeuCys: 0.262 ± 0.126
5.504LeuAsp: 5.504 ± 0.579
4.823LeuGlu: 4.823 ± 0.523
2.202LeuPhe: 2.202 ± 0.347
6.71LeuGly: 6.71 ± 0.624
1.048LeuHis: 1.048 ± 0.251
3.303LeuIle: 3.303 ± 0.41
3.303LeuLys: 3.303 ± 0.375
3.512LeuLeu: 3.512 ± 0.394
1.206LeuMet: 1.206 ± 0.218
2.516LeuAsn: 2.516 ± 0.438
4.613LeuPro: 4.613 ± 0.5
2.411LeuGln: 2.411 ± 0.435
4.613LeuArg: 4.613 ± 0.539
5.137LeuSer: 5.137 ± 0.47
4.246LeuThr: 4.246 ± 0.466
4.403LeuVal: 4.403 ± 0.423
1.258LeuTrp: 1.258 ± 0.262
1.835LeuTyr: 1.835 ± 0.279
0.0LeuXaa: 0.0 ± 0.0
Met
3.46MetAla: 3.46 ± 0.398
0.0MetCys: 0.0 ± 0.0
1.311MetAsp: 1.311 ± 0.279
1.415MetGlu: 1.415 ± 0.24
0.786MetPhe: 0.786 ± 0.254
1.678MetGly: 1.678 ± 0.292
0.629MetHis: 0.629 ± 0.201
0.891MetIle: 0.891 ± 0.25
1.153MetLys: 1.153 ± 0.239
0.734MetLeu: 0.734 ± 0.176
0.472MetMet: 0.472 ± 0.171
0.996MetAsn: 0.996 ± 0.221
1.101MetPro: 1.101 ± 0.251
0.734MetGln: 0.734 ± 0.198
1.468MetArg: 1.468 ± 0.262
2.621MetSer: 2.621 ± 0.39
2.149MetThr: 2.149 ± 0.312
1.363MetVal: 1.363 ± 0.287
0.262MetTrp: 0.262 ± 0.128
0.367MetTyr: 0.367 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.407AsnAla: 3.407 ± 0.408
0.052AsnCys: 0.052 ± 0.058
1.835AsnAsp: 1.835 ± 0.367
2.149AsnGlu: 2.149 ± 0.305
1.415AsnPhe: 1.415 ± 0.246
4.666AsnGly: 4.666 ± 0.508
0.786AsnHis: 0.786 ± 0.201
1.468AsnIle: 1.468 ± 0.313
1.258AsnLys: 1.258 ± 0.292
3.198AsnLeu: 3.198 ± 0.349
0.786AsnMet: 0.786 ± 0.188
1.73AsnAsn: 1.73 ± 0.315
2.254AsnPro: 2.254 ± 0.285
1.048AsnGln: 1.048 ± 0.329
2.464AsnArg: 2.464 ± 0.362
2.936AsnSer: 2.936 ± 0.477
1.887AsnThr: 1.887 ± 0.314
2.674AsnVal: 2.674 ± 0.315
0.629AsnTrp: 0.629 ± 0.169
1.311AsnTyr: 1.311 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
3.879ProAla: 3.879 ± 0.439
0.262ProCys: 0.262 ± 0.112
3.565ProAsp: 3.565 ± 0.426
4.77ProGlu: 4.77 ± 0.547
1.52ProPhe: 1.52 ± 0.308
3.932ProGly: 3.932 ± 0.459
0.681ProHis: 0.681 ± 0.166
2.202ProIle: 2.202 ± 0.377
2.097ProLys: 2.097 ± 0.383
2.621ProLeu: 2.621 ± 0.356
0.891ProMet: 0.891 ± 0.213
1.573ProAsn: 1.573 ± 0.303
1.678ProPro: 1.678 ± 0.317
1.782ProGln: 1.782 ± 0.349
1.625ProArg: 1.625 ± 0.263
2.254ProSer: 2.254 ± 0.33
2.883ProThr: 2.883 ± 0.411
3.512ProVal: 3.512 ± 0.501
0.786ProTrp: 0.786 ± 0.223
1.101ProTyr: 1.101 ± 0.205
0.0ProXaa: 0.0 ± 0.0
Gln
3.984GlnAla: 3.984 ± 0.48
0.21GlnCys: 0.21 ± 0.106
1.94GlnAsp: 1.94 ± 0.363
2.516GlnGlu: 2.516 ± 0.371
1.311GlnPhe: 1.311 ± 0.248
2.831GlnGly: 2.831 ± 0.356
0.419GlnHis: 0.419 ± 0.192
2.464GlnIle: 2.464 ± 0.355
1.625GlnLys: 1.625 ± 0.274
2.569GlnLeu: 2.569 ± 0.405
0.839GlnMet: 0.839 ± 0.202
1.206GlnAsn: 1.206 ± 0.222
1.258GlnPro: 1.258 ± 0.245
0.996GlnGln: 0.996 ± 0.209
1.887GlnArg: 1.887 ± 0.398
2.202GlnSer: 2.202 ± 0.345
1.992GlnThr: 1.992 ± 0.305
3.827GlnVal: 3.827 ± 0.439
0.786GlnTrp: 0.786 ± 0.218
0.891GlnTyr: 0.891 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
6.029ArgAla: 6.029 ± 0.555
0.315ArgCys: 0.315 ± 0.136
4.141ArgAsp: 4.141 ± 0.45
5.347ArgGlu: 5.347 ± 0.595
2.097ArgPhe: 2.097 ± 0.338
4.666ArgGly: 4.666 ± 0.476
1.258ArgHis: 1.258 ± 0.243
4.194ArgIle: 4.194 ± 0.454
3.355ArgLys: 3.355 ± 0.484
4.77ArgLeu: 4.77 ± 0.468
1.887ArgMet: 1.887 ± 0.391
2.674ArgAsn: 2.674 ± 0.415
2.883ArgPro: 2.883 ± 0.342
1.992ArgGln: 1.992 ± 0.351
4.928ArgArg: 4.928 ± 0.467
3.407ArgSer: 3.407 ± 0.416
3.04ArgThr: 3.04 ± 0.455
4.823ArgVal: 4.823 ± 0.516
1.311ArgTrp: 1.311 ± 0.346
2.621ArgTyr: 2.621 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
6.343SerAla: 6.343 ± 0.732
0.262SerCys: 0.262 ± 0.142
3.25SerAsp: 3.25 ± 0.432
3.774SerGlu: 3.774 ± 0.474
2.097SerPhe: 2.097 ± 0.321
5.609SerGly: 5.609 ± 0.548
0.944SerHis: 0.944 ± 0.226
3.25SerIle: 3.25 ± 0.427
3.145SerLys: 3.145 ± 0.404
4.77SerLeu: 4.77 ± 0.635
1.573SerMet: 1.573 ± 0.306
2.359SerAsn: 2.359 ± 0.5
2.464SerPro: 2.464 ± 0.448
1.678SerGln: 1.678 ± 0.276
3.617SerArg: 3.617 ± 0.31
4.036SerSer: 4.036 ± 0.655
3.774SerThr: 3.774 ± 0.498
3.722SerVal: 3.722 ± 0.468
1.52SerTrp: 1.52 ± 0.271
2.359SerTyr: 2.359 ± 0.363
0.0SerXaa: 0.0 ± 0.0
Thr
5.557ThrAla: 5.557 ± 0.607
0.315ThrCys: 0.315 ± 0.148
3.879ThrAsp: 3.879 ± 0.516
4.036ThrGlu: 4.036 ± 0.451
2.621ThrPhe: 2.621 ± 0.34
5.19ThrGly: 5.19 ± 0.552
0.891ThrHis: 0.891 ± 0.281
3.722ThrIle: 3.722 ± 0.471
2.307ThrLys: 2.307 ± 0.306
4.718ThrLeu: 4.718 ± 0.461
1.258ThrMet: 1.258 ± 0.239
2.569ThrAsn: 2.569 ± 0.386
3.722ThrPro: 3.722 ± 0.427
1.73ThrGln: 1.73 ± 0.25
3.46ThrArg: 3.46 ± 0.345
3.25ThrSer: 3.25 ± 0.472
4.089ThrThr: 4.089 ± 0.458
4.77ThrVal: 4.77 ± 0.594
1.206ThrTrp: 1.206 ± 0.25
1.94ThrTyr: 1.94 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
7.863ValAla: 7.863 ± 0.653
0.367ValCys: 0.367 ± 0.147
5.085ValAsp: 5.085 ± 0.556
5.819ValGlu: 5.819 ± 0.632
1.73ValPhe: 1.73 ± 0.293
5.504ValGly: 5.504 ± 0.472
1.048ValHis: 1.048 ± 0.195
3.774ValIle: 3.774 ± 0.49
3.827ValLys: 3.827 ± 0.444
4.613ValLeu: 4.613 ± 0.467
0.786ValMet: 0.786 ± 0.168
2.831ValAsn: 2.831 ± 0.423
3.303ValPro: 3.303 ± 0.516
2.254ValGln: 2.254 ± 0.343
5.452ValArg: 5.452 ± 0.486
4.299ValSer: 4.299 ± 0.457
5.033ValThr: 5.033 ± 0.561
5.399ValVal: 5.399 ± 0.567
1.258ValTrp: 1.258 ± 0.261
2.149ValTyr: 2.149 ± 0.378
0.0ValXaa: 0.0 ± 0.0
Trp
1.782TrpAla: 1.782 ± 0.341
0.052TrpCys: 0.052 ± 0.054
1.625TrpAsp: 1.625 ± 0.304
1.258TrpGlu: 1.258 ± 0.277
0.577TrpPhe: 0.577 ± 0.211
1.992TrpGly: 1.992 ± 0.257
0.157TrpHis: 0.157 ± 0.101
0.944TrpIle: 0.944 ± 0.212
0.367TrpLys: 0.367 ± 0.137
0.996TrpLeu: 0.996 ± 0.171
0.472TrpMet: 0.472 ± 0.159
1.048TrpAsn: 1.048 ± 0.2
0.786TrpPro: 0.786 ± 0.221
0.681TrpGln: 0.681 ± 0.217
1.992TrpArg: 1.992 ± 0.433
1.311TrpSer: 1.311 ± 0.319
0.786TrpThr: 0.786 ± 0.213
1.73TrpVal: 1.73 ± 0.309
0.524TrpTrp: 0.524 ± 0.161
0.629TrpTyr: 0.629 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.722TyrAla: 3.722 ± 0.435
0.262TyrCys: 0.262 ± 0.115
1.94TyrAsp: 1.94 ± 0.386
2.202TyrGlu: 2.202 ± 0.32
0.891TyrPhe: 0.891 ± 0.236
2.359TyrGly: 2.359 ± 0.318
0.577TyrHis: 0.577 ± 0.168
1.415TyrIle: 1.415 ± 0.259
1.048TyrLys: 1.048 ± 0.217
1.782TyrLeu: 1.782 ± 0.36
0.629TyrMet: 0.629 ± 0.159
1.363TyrAsn: 1.363 ± 0.227
1.573TyrPro: 1.573 ± 0.248
1.258TyrGln: 1.258 ± 0.253
1.782TyrArg: 1.782 ± 0.314
2.516TyrSer: 2.516 ± 0.426
2.097TyrThr: 2.097 ± 0.329
2.831TyrVal: 2.831 ± 0.382
0.472TyrTrp: 0.472 ± 0.152
0.891TyrTyr: 0.891 ± 0.236
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 114 proteins (19077 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski