Amino acid dipepetide frequency for Streptomyces phage Hank144

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.283AlaAla: 11.283 ± 0.996
0.838AlaCys: 0.838 ± 0.275
7.93AlaAsp: 7.93 ± 0.832
9.026AlaGlu: 9.026 ± 0.842
2.837AlaPhe: 2.837 ± 0.447
8.446AlaGly: 8.446 ± 0.987
2.063AlaHis: 2.063 ± 0.377
5.545AlaIle: 5.545 ± 0.633
5.416AlaLys: 5.416 ± 0.597
11.154AlaLeu: 11.154 ± 1.03
3.03AlaMet: 3.03 ± 0.38
2.643AlaAsn: 2.643 ± 0.591
4.642AlaPro: 4.642 ± 0.491
2.966AlaGln: 2.966 ± 0.456
5.738AlaArg: 5.738 ± 0.67
5.351AlaSer: 5.351 ± 0.646
7.157AlaThr: 7.157 ± 0.712
7.544AlaVal: 7.544 ± 0.735
2.579AlaTrp: 2.579 ± 0.369
3.03AlaTyr: 3.03 ± 0.496
0.0AlaXaa: 0.0 ± 0.0
Cys
0.645CysAla: 0.645 ± 0.193
0.0CysCys: 0.0 ± 0.0
0.516CysAsp: 0.516 ± 0.18
0.516CysGlu: 0.516 ± 0.19
0.322CysPhe: 0.322 ± 0.124
1.032CysGly: 1.032 ± 0.309
0.387CysHis: 0.387 ± 0.133
0.258CysIle: 0.258 ± 0.122
0.645CysLys: 0.645 ± 0.181
0.387CysLeu: 0.387 ± 0.16
0.0CysMet: 0.0 ± 0.0
0.322CysAsn: 0.322 ± 0.144
0.645CysPro: 0.645 ± 0.27
0.064CysGln: 0.064 ± 0.056
0.709CysArg: 0.709 ± 0.28
0.838CysSer: 0.838 ± 0.221
0.322CysThr: 0.322 ± 0.134
0.258CysVal: 0.258 ± 0.161
0.129CysTrp: 0.129 ± 0.091
0.322CysTyr: 0.322 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
7.157AspAla: 7.157 ± 0.82
0.58AspCys: 0.58 ± 0.198
4.9AspAsp: 4.9 ± 0.714
5.803AspGlu: 5.803 ± 0.81
1.87AspPhe: 1.87 ± 0.286
6.254AspGly: 6.254 ± 0.632
1.096AspHis: 1.096 ± 0.295
3.159AspIle: 3.159 ± 0.389
2.45AspLys: 2.45 ± 0.427
5.222AspLeu: 5.222 ± 0.716
1.676AspMet: 1.676 ± 0.292
1.418AspAsn: 1.418 ± 0.234
3.997AspPro: 3.997 ± 0.527
1.999AspGln: 1.999 ± 0.246
3.74AspArg: 3.74 ± 0.486
3.546AspSer: 3.546 ± 0.513
3.288AspThr: 3.288 ± 0.5
4.255AspVal: 4.255 ± 0.481
1.418AspTrp: 1.418 ± 0.285
1.676AspTyr: 1.676 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
8.769GluAla: 8.769 ± 0.824
0.58GluCys: 0.58 ± 0.181
4.32GluAsp: 4.32 ± 0.654
6.19GluGlu: 6.19 ± 0.924
1.805GluPhe: 1.805 ± 0.308
6.512GluGly: 6.512 ± 0.557
2.128GluHis: 2.128 ± 0.417
4.32GluIle: 4.32 ± 0.558
2.128GluLys: 2.128 ± 0.309
7.415GluLeu: 7.415 ± 0.825
1.096GluMet: 1.096 ± 0.24
1.289GluAsn: 1.289 ± 0.278
3.417GluPro: 3.417 ± 0.487
2.45GluGln: 2.45 ± 0.403
5.609GluArg: 5.609 ± 0.684
3.675GluSer: 3.675 ± 0.493
4.578GluThr: 4.578 ± 0.754
4.707GluVal: 4.707 ± 0.602
1.418GluTrp: 1.418 ± 0.282
2.45GluTyr: 2.45 ± 0.488
0.0GluXaa: 0.0 ± 0.0
Phe
3.353PheAla: 3.353 ± 0.591
0.322PheCys: 0.322 ± 0.125
2.128PheAsp: 2.128 ± 0.354
2.45PheGlu: 2.45 ± 0.387
0.838PhePhe: 0.838 ± 0.252
3.224PheGly: 3.224 ± 0.348
0.258PheHis: 0.258 ± 0.126
1.096PheIle: 1.096 ± 0.311
0.774PheLys: 0.774 ± 0.232
1.612PheLeu: 1.612 ± 0.359
0.709PheMet: 0.709 ± 0.335
0.709PheAsn: 0.709 ± 0.207
1.289PhePro: 1.289 ± 0.304
1.161PheGln: 1.161 ± 0.234
1.934PheArg: 1.934 ± 0.302
1.676PheSer: 1.676 ± 0.281
1.999PheThr: 1.999 ± 0.378
2.063PheVal: 2.063 ± 0.422
0.645PheTrp: 0.645 ± 0.172
0.645PheTyr: 0.645 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
7.221GlyAla: 7.221 ± 1.025
0.322GlyCys: 0.322 ± 0.175
4.707GlyAsp: 4.707 ± 0.559
6.061GlyGlu: 6.061 ± 0.613
2.772GlyPhe: 2.772 ± 0.458
6.899GlyGly: 6.899 ± 1.118
1.999GlyHis: 1.999 ± 0.351
3.868GlyIle: 3.868 ± 0.867
4.707GlyLys: 4.707 ± 0.647
6.254GlyLeu: 6.254 ± 0.642
1.354GlyMet: 1.354 ± 0.232
3.095GlyAsn: 3.095 ± 0.399
4.32GlyPro: 4.32 ± 0.519
2.708GlyGln: 2.708 ± 0.386
5.674GlyArg: 5.674 ± 0.675
5.48GlySer: 5.48 ± 0.943
5.48GlyThr: 5.48 ± 0.856
5.996GlyVal: 5.996 ± 0.632
2.192GlyTrp: 2.192 ± 0.452
2.643GlyTyr: 2.643 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
1.612HisAla: 1.612 ± 0.333
0.258HisCys: 0.258 ± 0.128
1.354HisAsp: 1.354 ± 0.253
1.741HisGlu: 1.741 ± 0.364
0.838HisPhe: 0.838 ± 0.304
1.805HisGly: 1.805 ± 0.358
0.903HisHis: 0.903 ± 0.24
0.838HisIle: 0.838 ± 0.205
0.516HisLys: 0.516 ± 0.194
1.805HisLeu: 1.805 ± 0.383
0.258HisMet: 0.258 ± 0.122
0.645HisAsn: 0.645 ± 0.211
0.903HisPro: 0.903 ± 0.276
0.516HisGln: 0.516 ± 0.156
1.354HisArg: 1.354 ± 0.397
1.096HisSer: 1.096 ± 0.433
1.289HisThr: 1.289 ± 0.262
1.032HisVal: 1.032 ± 0.254
0.709HisTrp: 0.709 ± 0.271
0.709HisTyr: 0.709 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.836IleAla: 4.836 ± 0.568
0.193IleCys: 0.193 ± 0.107
3.933IleAsp: 3.933 ± 0.626
3.611IleGlu: 3.611 ± 0.495
1.032IlePhe: 1.032 ± 0.313
3.353IleGly: 3.353 ± 0.468
0.838IleHis: 0.838 ± 0.229
1.87IleIle: 1.87 ± 0.406
2.321IleLys: 2.321 ± 0.346
3.353IleLeu: 3.353 ± 0.693
0.387IleMet: 0.387 ± 0.167
1.354IleAsn: 1.354 ± 0.237
2.386IlePro: 2.386 ± 0.4
2.063IleGln: 2.063 ± 0.531
3.095IleArg: 3.095 ± 0.366
2.515IleSer: 2.515 ± 0.372
2.837IleThr: 2.837 ± 0.463
3.675IleVal: 3.675 ± 0.503
0.322IleTrp: 0.322 ± 0.157
1.418IleTyr: 1.418 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
4.9LysAla: 4.9 ± 0.543
0.58LysCys: 0.58 ± 0.186
2.708LysAsp: 2.708 ± 0.494
2.45LysGlu: 2.45 ± 0.413
1.096LysPhe: 1.096 ± 0.241
4.384LysGly: 4.384 ± 0.508
0.903LysHis: 0.903 ± 0.242
1.741LysIle: 1.741 ± 0.381
2.772LysLys: 2.772 ± 0.543
3.675LysLeu: 3.675 ± 0.669
0.645LysMet: 0.645 ± 0.165
0.903LysAsn: 0.903 ± 0.279
2.708LysPro: 2.708 ± 0.439
1.87LysGln: 1.87 ± 0.335
3.933LysArg: 3.933 ± 0.668
2.45LysSer: 2.45 ± 0.43
1.934LysThr: 1.934 ± 0.364
2.772LysVal: 2.772 ± 0.42
0.645LysTrp: 0.645 ± 0.246
1.741LysTyr: 1.741 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
11.283LeuAla: 11.283 ± 0.99
0.58LeuCys: 0.58 ± 0.21
5.351LeuAsp: 5.351 ± 0.533
4.9LeuGlu: 4.9 ± 0.748
1.741LeuPhe: 1.741 ± 0.354
6.576LeuGly: 6.576 ± 0.959
1.289LeuHis: 1.289 ± 0.297
4.255LeuIle: 4.255 ± 0.505
3.417LeuLys: 3.417 ± 0.561
6.899LeuLeu: 6.899 ± 0.809
1.612LeuMet: 1.612 ± 0.421
3.095LeuAsn: 3.095 ± 0.419
4.513LeuPro: 4.513 ± 0.56
2.386LeuGln: 2.386 ± 0.407
6.061LeuArg: 6.061 ± 0.862
4.9LeuSer: 4.9 ± 0.505
4.965LeuThr: 4.965 ± 0.518
6.383LeuVal: 6.383 ± 0.783
0.709LeuTrp: 0.709 ± 0.199
1.87LeuTyr: 1.87 ± 0.348
0.0LeuXaa: 0.0 ± 0.0
Met
3.224MetAla: 3.224 ± 0.445
0.193MetCys: 0.193 ± 0.106
0.645MetAsp: 0.645 ± 0.201
1.032MetGlu: 1.032 ± 0.309
0.774MetPhe: 0.774 ± 0.237
1.289MetGly: 1.289 ± 0.27
0.451MetHis: 0.451 ± 0.156
0.838MetIle: 0.838 ± 0.25
0.903MetLys: 0.903 ± 0.25
1.483MetLeu: 1.483 ± 0.381
0.58MetMet: 0.58 ± 0.206
0.709MetAsn: 0.709 ± 0.187
1.032MetPro: 1.032 ± 0.246
0.58MetGln: 0.58 ± 0.174
1.354MetArg: 1.354 ± 0.327
2.643MetSer: 2.643 ± 0.482
1.805MetThr: 1.805 ± 0.338
1.161MetVal: 1.161 ± 0.281
0.258MetTrp: 0.258 ± 0.125
0.387MetTyr: 0.387 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
2.901AsnAla: 2.901 ± 0.665
0.645AsnCys: 0.645 ± 0.231
1.805AsnAsp: 1.805 ± 0.333
1.87AsnGlu: 1.87 ± 0.302
1.225AsnPhe: 1.225 ± 0.279
2.966AsnGly: 2.966 ± 0.594
0.516AsnHis: 0.516 ± 0.165
1.289AsnIle: 1.289 ± 0.221
0.709AsnLys: 0.709 ± 0.197
2.643AsnLeu: 2.643 ± 0.416
0.387AsnMet: 0.387 ± 0.134
1.096AsnAsn: 1.096 ± 0.255
1.805AsnPro: 1.805 ± 0.317
1.225AsnGln: 1.225 ± 0.236
1.741AsnArg: 1.741 ± 0.309
1.741AsnSer: 1.741 ± 0.334
1.418AsnThr: 1.418 ± 0.282
1.676AsnVal: 1.676 ± 0.404
0.451AsnTrp: 0.451 ± 0.158
0.709AsnTyr: 0.709 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
5.351ProAla: 5.351 ± 0.677
0.516ProCys: 0.516 ± 0.18
3.159ProAsp: 3.159 ± 0.442
4.062ProGlu: 4.062 ± 0.461
1.096ProPhe: 1.096 ± 0.299
4.578ProGly: 4.578 ± 0.566
0.774ProHis: 0.774 ± 0.216
2.321ProIle: 2.321 ± 0.51
2.257ProLys: 2.257 ± 0.443
2.772ProLeu: 2.772 ± 0.483
1.225ProMet: 1.225 ± 0.304
2.063ProAsn: 2.063 ± 0.408
2.192ProPro: 2.192 ± 0.342
1.741ProGln: 1.741 ± 0.371
2.901ProArg: 2.901 ± 0.39
3.224ProSer: 3.224 ± 0.538
3.03ProThr: 3.03 ± 0.58
2.901ProVal: 2.901 ± 0.344
0.903ProTrp: 0.903 ± 0.243
1.547ProTyr: 1.547 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
4.384GlnAla: 4.384 ± 0.439
0.387GlnCys: 0.387 ± 0.168
1.612GlnAsp: 1.612 ± 0.374
2.772GlnGlu: 2.772 ± 0.497
1.289GlnPhe: 1.289 ± 0.367
2.321GlnGly: 2.321 ± 0.46
0.58GlnHis: 0.58 ± 0.191
1.612GlnIle: 1.612 ± 0.308
1.161GlnLys: 1.161 ± 0.328
3.804GlnLeu: 3.804 ± 0.552
1.032GlnMet: 1.032 ± 0.257
0.709GlnAsn: 0.709 ± 0.179
0.903GlnPro: 0.903 ± 0.243
0.645GlnGln: 0.645 ± 0.218
2.128GlnArg: 2.128 ± 0.535
1.741GlnSer: 1.741 ± 0.333
1.87GlnThr: 1.87 ± 0.358
2.772GlnVal: 2.772 ± 0.385
0.516GlnTrp: 0.516 ± 0.225
0.774GlnTyr: 0.774 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
5.674ArgAla: 5.674 ± 0.622
0.774ArgCys: 0.774 ± 0.257
4.771ArgAsp: 4.771 ± 0.615
5.351ArgGlu: 5.351 ± 0.713
2.45ArgPhe: 2.45 ± 0.36
4.062ArgGly: 4.062 ± 0.453
1.547ArgHis: 1.547 ± 0.358
2.128ArgIle: 2.128 ± 0.401
4.449ArgLys: 4.449 ± 0.666
4.965ArgLeu: 4.965 ± 0.539
1.934ArgMet: 1.934 ± 0.419
1.676ArgAsn: 1.676 ± 0.302
3.095ArgPro: 3.095 ± 0.503
2.708ArgGln: 2.708 ± 0.497
5.48ArgArg: 5.48 ± 1.036
3.224ArgSer: 3.224 ± 0.554
4.707ArgThr: 4.707 ± 0.593
4.965ArgVal: 4.965 ± 0.651
1.225ArgTrp: 1.225 ± 0.336
1.547ArgTyr: 1.547 ± 0.293
0.0ArgXaa: 0.0 ± 0.0
Ser
7.608SerAla: 7.608 ± 0.747
0.064SerCys: 0.064 ± 0.062
3.288SerAsp: 3.288 ± 0.437
3.675SerGlu: 3.675 ± 0.482
1.676SerPhe: 1.676 ± 0.346
5.351SerGly: 5.351 ± 0.776
1.096SerHis: 1.096 ± 0.366
2.45SerIle: 2.45 ± 0.327
2.643SerLys: 2.643 ± 0.465
5.416SerLeu: 5.416 ± 0.638
1.161SerMet: 1.161 ± 0.328
1.87SerAsn: 1.87 ± 0.39
2.579SerPro: 2.579 ± 0.373
1.87SerGln: 1.87 ± 0.298
3.997SerArg: 3.997 ± 0.485
3.74SerSer: 3.74 ± 0.707
3.482SerThr: 3.482 ± 0.551
4.255SerVal: 4.255 ± 0.669
1.225SerTrp: 1.225 ± 0.25
2.386SerTyr: 2.386 ± 0.482
0.0SerXaa: 0.0 ± 0.0
Thr
5.867ThrAla: 5.867 ± 0.676
0.645ThrCys: 0.645 ± 0.231
4.191ThrAsp: 4.191 ± 0.556
4.578ThrGlu: 4.578 ± 0.515
2.192ThrPhe: 2.192 ± 0.388
5.158ThrGly: 5.158 ± 0.888
1.354ThrHis: 1.354 ± 0.347
3.224ThrIle: 3.224 ± 0.555
2.643ThrLys: 2.643 ± 0.395
5.093ThrLeu: 5.093 ± 0.646
1.161ThrMet: 1.161 ± 0.257
1.161ThrAsn: 1.161 ± 0.361
3.611ThrPro: 3.611 ± 0.68
1.805ThrGln: 1.805 ± 0.355
3.03ThrArg: 3.03 ± 0.532
4.707ThrSer: 4.707 ± 0.797
4.384ThrThr: 4.384 ± 0.595
5.287ThrVal: 5.287 ± 0.631
0.967ThrTrp: 0.967 ± 0.236
1.676ThrTyr: 1.676 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
7.866ValAla: 7.866 ± 0.727
0.58ValCys: 0.58 ± 0.177
3.933ValAsp: 3.933 ± 0.436
5.029ValGlu: 5.029 ± 0.647
1.676ValPhe: 1.676 ± 0.248
5.029ValGly: 5.029 ± 0.647
1.161ValHis: 1.161 ± 0.263
3.03ValIle: 3.03 ± 0.414
3.03ValLys: 3.03 ± 0.425
5.287ValLeu: 5.287 ± 0.758
1.87ValMet: 1.87 ± 0.418
2.45ValAsn: 2.45 ± 0.551
2.966ValPro: 2.966 ± 0.474
3.095ValGln: 3.095 ± 0.456
4.513ValArg: 4.513 ± 0.486
4.062ValSer: 4.062 ± 0.559
5.48ValThr: 5.48 ± 0.804
4.578ValVal: 4.578 ± 0.448
1.161ValTrp: 1.161 ± 0.231
1.87ValTyr: 1.87 ± 0.401
0.0ValXaa: 0.0 ± 0.0
Trp
1.934TrpAla: 1.934 ± 0.361
0.129TrpCys: 0.129 ± 0.079
1.418TrpAsp: 1.418 ± 0.295
1.612TrpGlu: 1.612 ± 0.285
0.451TrpPhe: 0.451 ± 0.143
1.547TrpGly: 1.547 ± 0.295
0.387TrpHis: 0.387 ± 0.169
0.516TrpIle: 0.516 ± 0.184
0.967TrpLys: 0.967 ± 0.227
1.547TrpLeu: 1.547 ± 0.417
0.516TrpMet: 0.516 ± 0.156
0.903TrpAsn: 0.903 ± 0.229
0.516TrpPro: 0.516 ± 0.186
0.322TrpGln: 0.322 ± 0.139
1.354TrpArg: 1.354 ± 0.224
1.096TrpSer: 1.096 ± 0.279
1.418TrpThr: 1.418 ± 0.308
1.032TrpVal: 1.032 ± 0.28
0.193TrpTrp: 0.193 ± 0.117
0.387TrpTyr: 0.387 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.353TyrAla: 3.353 ± 0.408
0.129TyrCys: 0.129 ± 0.086
2.837TyrAsp: 2.837 ± 0.525
2.128TyrGlu: 2.128 ± 0.396
0.903TyrPhe: 0.903 ± 0.307
3.03TyrGly: 3.03 ± 0.556
0.451TyrHis: 0.451 ± 0.189
1.032TyrIle: 1.032 ± 0.332
0.903TyrLys: 0.903 ± 0.311
1.999TyrLeu: 1.999 ± 0.382
0.645TyrMet: 0.645 ± 0.19
0.709TyrAsn: 0.709 ± 0.16
1.096TyrPro: 1.096 ± 0.249
0.838TyrGln: 0.838 ± 0.216
2.386TyrArg: 2.386 ± 0.484
2.128TyrSer: 2.128 ± 0.377
1.289TyrThr: 1.289 ± 0.325
1.354TyrVal: 1.354 ± 0.278
0.58TyrTrp: 0.58 ± 0.18
0.516TyrTyr: 0.516 ± 0.182
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (15511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski