Amino acid dipepetide frequency for Streptomyces phage Madamato

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.504AlaAla: 10.504 ± 1.496
0.174AlaCys: 0.174 ± 0.138
4.527AlaAsp: 4.527 ± 0.671
7.312AlaGlu: 7.312 ± 0.582
2.612AlaPhe: 2.612 ± 0.345
6.79AlaGly: 6.79 ± 1.381
1.683AlaHis: 1.683 ± 0.33
4.991AlaIle: 4.991 ± 0.714
6.79AlaLys: 6.79 ± 0.804
8.067AlaLeu: 8.067 ± 1.389
2.96AlaMet: 2.96 ± 0.423
3.076AlaAsn: 3.076 ± 0.4
3.134AlaPro: 3.134 ± 0.438
3.366AlaGln: 3.366 ± 0.536
4.295AlaArg: 4.295 ± 0.56
4.411AlaSer: 4.411 ± 0.469
6.326AlaThr: 6.326 ± 0.609
5.513AlaVal: 5.513 ± 0.613
0.871AlaTrp: 0.871 ± 0.25
2.96AlaTyr: 2.96 ± 0.427
0.0AlaXaa: 0.0 ± 0.0
Cys
0.348CysAla: 0.348 ± 0.172
0.174CysCys: 0.174 ± 0.144
0.29CysAsp: 0.29 ± 0.14
0.406CysGlu: 0.406 ± 0.209
0.29CysPhe: 0.29 ± 0.134
0.522CysGly: 0.522 ± 0.22
0.174CysHis: 0.174 ± 0.096
0.348CysIle: 0.348 ± 0.15
0.174CysLys: 0.174 ± 0.084
0.754CysLeu: 0.754 ± 0.259
0.116CysMet: 0.116 ± 0.088
0.116CysAsn: 0.116 ± 0.076
0.232CysPro: 0.232 ± 0.129
0.232CysGln: 0.232 ± 0.125
0.116CysArg: 0.116 ± 0.083
0.29CysSer: 0.29 ± 0.101
0.348CysThr: 0.348 ± 0.15
0.174CysVal: 0.174 ± 0.106
0.116CysTrp: 0.116 ± 0.075
0.058CysTyr: 0.058 ± 0.05
0.0CysXaa: 0.0 ± 0.0
Asp
6.094AspAla: 6.094 ± 0.645
0.406AspCys: 0.406 ± 0.191
3.018AspAsp: 3.018 ± 0.5
5.745AspGlu: 5.745 ± 0.786
2.496AspPhe: 2.496 ± 0.398
5.92AspGly: 5.92 ± 0.671
1.045AspHis: 1.045 ± 0.263
3.25AspIle: 3.25 ± 0.61
3.656AspLys: 3.656 ± 0.511
3.946AspLeu: 3.946 ± 0.581
1.277AspMet: 1.277 ± 0.273
2.437AspAsn: 2.437 ± 0.392
3.772AspPro: 3.772 ± 0.639
1.509AspGln: 1.509 ± 0.327
2.728AspArg: 2.728 ± 0.534
3.366AspSer: 3.366 ± 0.364
3.366AspThr: 3.366 ± 0.451
3.076AspVal: 3.076 ± 0.434
1.625AspTrp: 1.625 ± 0.347
1.683AspTyr: 1.683 ± 0.28
0.0AspXaa: 0.0 ± 0.0
Glu
5.165GluAla: 5.165 ± 0.667
0.232GluCys: 0.232 ± 0.147
5.107GluAsp: 5.107 ± 0.722
6.094GluGlu: 6.094 ± 0.872
3.366GluPhe: 3.366 ± 0.484
5.165GluGly: 5.165 ± 0.464
1.277GluHis: 1.277 ± 0.315
3.714GluIle: 3.714 ± 0.4
5.223GluLys: 5.223 ± 0.56
6.094GluLeu: 6.094 ± 0.599
2.147GluMet: 2.147 ± 0.302
3.192GluAsn: 3.192 ± 0.479
3.018GluPro: 3.018 ± 0.47
3.134GluGln: 3.134 ± 0.398
4.469GluArg: 4.469 ± 0.672
4.12GluSer: 4.12 ± 0.548
3.656GluThr: 3.656 ± 0.446
5.397GluVal: 5.397 ± 0.62
0.987GluTrp: 0.987 ± 0.233
2.379GluTyr: 2.379 ± 0.417
0.0GluXaa: 0.0 ± 0.0
Phe
2.786PheAla: 2.786 ± 0.409
0.232PheCys: 0.232 ± 0.143
3.192PheAsp: 3.192 ± 0.384
2.728PheGlu: 2.728 ± 0.362
1.393PhePhe: 1.393 ± 0.262
3.076PheGly: 3.076 ± 0.469
0.522PheHis: 0.522 ± 0.168
1.741PheIle: 1.741 ± 0.368
2.379PheLys: 2.379 ± 0.491
2.67PheLeu: 2.67 ± 0.394
0.812PheMet: 0.812 ± 0.164
1.219PheAsn: 1.219 ± 0.291
1.103PhePro: 1.103 ± 0.225
1.509PheGln: 1.509 ± 0.325
2.728PheArg: 2.728 ± 0.383
2.321PheSer: 2.321 ± 0.52
1.973PheThr: 1.973 ± 0.406
2.089PheVal: 2.089 ± 0.341
0.406PheTrp: 0.406 ± 0.153
0.696PheTyr: 0.696 ± 0.338
0.0PheXaa: 0.0 ± 0.0
Gly
6.906GlyAla: 6.906 ± 0.949
0.464GlyCys: 0.464 ± 0.18
4.585GlyAsp: 4.585 ± 0.491
4.585GlyGlu: 4.585 ± 0.488
3.134GlyPhe: 3.134 ± 0.488
5.397GlyGly: 5.397 ± 0.496
1.277GlyHis: 1.277 ± 0.259
4.817GlyIle: 4.817 ± 0.761
4.585GlyLys: 4.585 ± 0.571
6.21GlyLeu: 6.21 ± 1.604
2.496GlyMet: 2.496 ± 0.369
2.321GlyAsn: 2.321 ± 0.433
2.554GlyPro: 2.554 ± 0.388
1.683GlyGln: 1.683 ± 0.349
2.728GlyArg: 2.728 ± 0.321
5.049GlySer: 5.049 ± 0.652
6.094GlyThr: 6.094 ± 0.596
6.616GlyVal: 6.616 ± 0.86
1.045GlyTrp: 1.045 ± 0.223
3.54GlyTyr: 3.54 ± 0.692
0.0GlyXaa: 0.0 ± 0.0
His
1.103HisAla: 1.103 ± 0.323
0.29HisCys: 0.29 ± 0.133
1.219HisAsp: 1.219 ± 0.235
1.277HisGlu: 1.277 ± 0.35
0.348HisPhe: 0.348 ± 0.173
1.451HisGly: 1.451 ± 0.355
0.58HisHis: 0.58 ± 0.203
1.335HisIle: 1.335 ± 0.337
0.987HisLys: 0.987 ± 0.278
0.987HisLeu: 0.987 ± 0.258
0.58HisMet: 0.58 ± 0.172
0.464HisAsn: 0.464 ± 0.175
0.871HisPro: 0.871 ± 0.237
0.754HisGln: 0.754 ± 0.215
1.045HisArg: 1.045 ± 0.318
1.161HisSer: 1.161 ± 0.228
0.812HisThr: 0.812 ± 0.267
1.393HisVal: 1.393 ± 0.364
0.232HisTrp: 0.232 ± 0.124
1.335HisTyr: 1.335 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
4.933IleAla: 4.933 ± 0.524
0.406IleCys: 0.406 ± 0.177
3.772IleAsp: 3.772 ± 0.353
4.411IleGlu: 4.411 ± 0.56
2.031IlePhe: 2.031 ± 0.35
3.54IleGly: 3.54 ± 0.851
0.987IleHis: 0.987 ± 0.299
2.089IleIle: 2.089 ± 0.374
3.772IleLys: 3.772 ± 0.437
4.179IleLeu: 4.179 ± 0.393
1.045IleMet: 1.045 ± 0.217
2.205IleAsn: 2.205 ± 0.37
2.263IlePro: 2.263 ± 0.357
1.915IleGln: 1.915 ± 0.304
3.076IleArg: 3.076 ± 0.44
3.424IleSer: 3.424 ± 0.592
3.424IleThr: 3.424 ± 0.467
4.411IleVal: 4.411 ± 0.43
0.522IleTrp: 0.522 ± 0.178
1.335IleTyr: 1.335 ± 0.265
0.0IleXaa: 0.0 ± 0.0
Lys
6.848LysAla: 6.848 ± 0.842
0.116LysCys: 0.116 ± 0.064
3.308LysAsp: 3.308 ± 0.452
3.83LysGlu: 3.83 ± 0.522
1.915LysPhe: 1.915 ± 0.323
4.933LysGly: 4.933 ± 1.154
1.219LysHis: 1.219 ± 0.295
3.83LysIle: 3.83 ± 0.468
6.384LysLys: 6.384 ± 0.843
6.848LysLeu: 6.848 ± 0.679
2.147LysMet: 2.147 ± 0.464
3.134LysAsn: 3.134 ± 0.435
2.321LysPro: 2.321 ± 0.546
2.205LysGln: 2.205 ± 0.39
2.902LysArg: 2.902 ± 0.547
5.571LysSer: 5.571 ± 0.486
4.469LysThr: 4.469 ± 0.515
3.83LysVal: 3.83 ± 0.353
0.987LysTrp: 0.987 ± 0.261
2.089LysTyr: 2.089 ± 0.245
0.0LysXaa: 0.0 ± 0.0
Leu
8.415LeuAla: 8.415 ± 1.248
0.464LeuCys: 0.464 ± 0.173
4.295LeuAsp: 4.295 ± 0.54
6.558LeuGlu: 6.558 ± 0.619
2.379LeuPhe: 2.379 ± 0.404
6.21LeuGly: 6.21 ± 0.992
1.625LeuHis: 1.625 ± 0.334
3.656LeuIle: 3.656 ± 0.672
6.152LeuLys: 6.152 ± 0.705
5.978LeuLeu: 5.978 ± 0.719
1.799LeuMet: 1.799 ± 0.318
3.308LeuAsn: 3.308 ± 0.424
2.205LeuPro: 2.205 ± 0.364
2.786LeuGln: 2.786 ± 0.354
4.12LeuArg: 4.12 ± 0.528
4.701LeuSer: 4.701 ± 0.568
6.268LeuThr: 6.268 ± 0.601
5.165LeuVal: 5.165 ± 0.636
1.045LeuTrp: 1.045 ± 0.249
2.67LeuTyr: 2.67 ± 0.403
0.0LeuXaa: 0.0 ± 0.0
Met
1.625MetAla: 1.625 ± 0.33
0.116MetCys: 0.116 ± 0.08
1.973MetAsp: 1.973 ± 0.359
1.393MetGlu: 1.393 ± 0.273
1.335MetPhe: 1.335 ± 0.261
1.451MetGly: 1.451 ± 0.326
0.174MetHis: 0.174 ± 0.081
1.277MetIle: 1.277 ± 0.34
1.799MetLys: 1.799 ± 0.246
2.612MetLeu: 2.612 ± 0.457
0.522MetMet: 0.522 ± 0.163
1.625MetAsn: 1.625 ± 0.304
1.277MetPro: 1.277 ± 0.273
0.58MetGln: 0.58 ± 0.182
1.567MetArg: 1.567 ± 0.305
2.67MetSer: 2.67 ± 0.434
2.147MetThr: 2.147 ± 0.389
1.335MetVal: 1.335 ± 0.244
0.232MetTrp: 0.232 ± 0.106
0.696MetTyr: 0.696 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
3.366AsnAla: 3.366 ± 0.4
0.116AsnCys: 0.116 ± 0.105
2.321AsnAsp: 2.321 ± 0.368
3.308AsnGlu: 3.308 ± 0.55
1.335AsnPhe: 1.335 ± 0.401
3.366AsnGly: 3.366 ± 0.415
0.987AsnHis: 0.987 ± 0.228
1.799AsnIle: 1.799 ± 0.314
2.147AsnLys: 2.147 ± 0.397
2.612AsnLeu: 2.612 ± 0.415
0.754AsnMet: 0.754 ± 0.189
1.857AsnAsn: 1.857 ± 0.322
2.263AsnPro: 2.263 ± 0.413
1.857AsnGln: 1.857 ± 0.307
2.496AsnArg: 2.496 ± 0.455
2.554AsnSer: 2.554 ± 0.337
2.554AsnThr: 2.554 ± 0.378
2.554AsnVal: 2.554 ± 0.386
0.174AsnTrp: 0.174 ± 0.117
1.335AsnTyr: 1.335 ± 0.342
0.0AsnXaa: 0.0 ± 0.0
Pro
2.67ProAla: 2.67 ± 0.448
0.348ProCys: 0.348 ± 0.146
3.83ProAsp: 3.83 ± 0.707
3.83ProGlu: 3.83 ± 0.675
1.451ProPhe: 1.451 ± 0.303
2.844ProGly: 2.844 ± 0.426
0.987ProHis: 0.987 ± 0.242
1.683ProIle: 1.683 ± 0.332
2.786ProLys: 2.786 ± 0.36
2.379ProLeu: 2.379 ± 0.392
1.103ProMet: 1.103 ± 0.259
1.393ProAsn: 1.393 ± 0.311
1.857ProPro: 1.857 ± 0.395
1.045ProGln: 1.045 ± 0.234
1.741ProArg: 1.741 ± 0.372
3.018ProSer: 3.018 ± 0.479
3.192ProThr: 3.192 ± 0.53
2.844ProVal: 2.844 ± 0.353
0.29ProTrp: 0.29 ± 0.145
1.219ProTyr: 1.219 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
2.728GlnAla: 2.728 ± 0.468
0.116GlnCys: 0.116 ± 0.082
1.683GlnAsp: 1.683 ± 0.34
2.263GlnGlu: 2.263 ± 0.43
1.219GlnPhe: 1.219 ± 0.272
2.612GlnGly: 2.612 ± 0.415
0.58GlnHis: 0.58 ± 0.194
2.437GlnIle: 2.437 ± 0.284
2.437GlnLys: 2.437 ± 0.332
3.134GlnLeu: 3.134 ± 0.508
0.929GlnMet: 0.929 ± 0.297
1.045GlnAsn: 1.045 ± 0.248
1.335GlnPro: 1.335 ± 0.358
1.509GlnGln: 1.509 ± 0.366
1.625GlnArg: 1.625 ± 0.237
1.973GlnSer: 1.973 ± 0.344
2.031GlnThr: 2.031 ± 0.358
2.147GlnVal: 2.147 ± 0.35
0.174GlnTrp: 0.174 ± 0.123
1.103GlnTyr: 1.103 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
4.004ArgAla: 4.004 ± 0.58
0.232ArgCys: 0.232 ± 0.137
2.728ArgAsp: 2.728 ± 0.462
4.004ArgGlu: 4.004 ± 0.55
1.277ArgPhe: 1.277 ± 0.256
2.902ArgGly: 2.902 ± 0.511
1.045ArgHis: 1.045 ± 0.331
2.554ArgIle: 2.554 ± 0.45
4.411ArgLys: 4.411 ± 0.475
4.527ArgLeu: 4.527 ± 0.733
1.393ArgMet: 1.393 ± 0.282
2.96ArgAsn: 2.96 ± 0.364
2.089ArgPro: 2.089 ± 0.384
2.031ArgGln: 2.031 ± 0.414
2.96ArgArg: 2.96 ± 0.627
3.134ArgSer: 3.134 ± 0.447
2.728ArgThr: 2.728 ± 0.382
2.96ArgVal: 2.96 ± 0.456
0.696ArgTrp: 0.696 ± 0.252
1.567ArgTyr: 1.567 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
5.803SerAla: 5.803 ± 0.811
0.29SerCys: 0.29 ± 0.164
3.25SerAsp: 3.25 ± 0.598
4.237SerGlu: 4.237 ± 0.517
2.205SerPhe: 2.205 ± 0.339
5.629SerGly: 5.629 ± 0.541
1.103SerHis: 1.103 ± 0.316
3.424SerIle: 3.424 ± 0.479
4.12SerLys: 4.12 ± 0.585
4.875SerLeu: 4.875 ± 0.462
1.799SerMet: 1.799 ± 0.301
2.437SerAsn: 2.437 ± 0.258
2.321SerPro: 2.321 ± 0.348
2.089SerGln: 2.089 ± 0.345
3.018SerArg: 3.018 ± 0.414
4.701SerSer: 4.701 ± 0.884
4.179SerThr: 4.179 ± 0.534
4.759SerVal: 4.759 ± 0.536
1.509SerTrp: 1.509 ± 0.256
2.031SerTyr: 2.031 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
6.558ThrAla: 6.558 ± 0.735
0.348ThrCys: 0.348 ± 0.155
4.179ThrAsp: 4.179 ± 0.454
3.83ThrGlu: 3.83 ± 0.464
2.67ThrPhe: 2.67 ± 0.365
5.803ThrGly: 5.803 ± 0.723
1.103ThrHis: 1.103 ± 0.26
4.353ThrIle: 4.353 ± 0.592
3.714ThrLys: 3.714 ± 0.432
4.933ThrLeu: 4.933 ± 0.531
1.625ThrMet: 1.625 ± 0.269
1.335ThrAsn: 1.335 ± 0.216
3.83ThrPro: 3.83 ± 0.58
1.161ThrGln: 1.161 ± 0.229
2.96ThrArg: 2.96 ± 0.454
3.192ThrSer: 3.192 ± 0.447
5.92ThrThr: 5.92 ± 0.774
6.094ThrVal: 6.094 ± 0.544
1.045ThrTrp: 1.045 ± 0.292
2.437ThrTyr: 2.437 ± 0.454
0.0ThrXaa: 0.0 ± 0.0
Val
6.036ValAla: 6.036 ± 0.495
0.29ValCys: 0.29 ± 0.121
3.946ValAsp: 3.946 ± 0.474
5.107ValGlu: 5.107 ± 0.586
2.147ValPhe: 2.147 ± 0.389
4.991ValGly: 4.991 ± 0.606
0.754ValHis: 0.754 ± 0.217
3.888ValIle: 3.888 ± 0.495
4.585ValLys: 4.585 ± 0.473
6.094ValLeu: 6.094 ± 0.678
2.031ValMet: 2.031 ± 0.327
2.67ValAsn: 2.67 ± 0.393
2.496ValPro: 2.496 ± 0.374
1.973ValGln: 1.973 ± 0.321
3.366ValArg: 3.366 ± 0.452
4.469ValSer: 4.469 ± 0.467
5.339ValThr: 5.339 ± 0.705
4.817ValVal: 4.817 ± 0.75
1.045ValTrp: 1.045 ± 0.248
2.96ValTyr: 2.96 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
1.277TrpAla: 1.277 ± 0.294
0.29TrpCys: 0.29 ± 0.111
0.929TrpAsp: 0.929 ± 0.261
1.277TrpGlu: 1.277 ± 0.276
0.406TrpPhe: 0.406 ± 0.139
1.451TrpGly: 1.451 ± 0.293
0.232TrpHis: 0.232 ± 0.127
0.638TrpIle: 0.638 ± 0.221
0.871TrpLys: 0.871 ± 0.253
0.754TrpLeu: 0.754 ± 0.205
0.406TrpMet: 0.406 ± 0.201
0.987TrpAsn: 0.987 ± 0.318
0.174TrpPro: 0.174 ± 0.091
0.232TrpGln: 0.232 ± 0.112
0.406TrpArg: 0.406 ± 0.18
1.045TrpSer: 1.045 ± 0.334
0.754TrpThr: 0.754 ± 0.197
1.045TrpVal: 1.045 ± 0.23
0.174TrpTrp: 0.174 ± 0.101
0.348TrpTyr: 0.348 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.192TyrAla: 3.192 ± 0.49
0.174TyrCys: 0.174 ± 0.082
2.263TyrAsp: 2.263 ± 0.454
1.799TyrGlu: 1.799 ± 0.308
1.567TyrPhe: 1.567 ± 0.415
2.031TyrGly: 2.031 ± 0.372
0.871TyrHis: 0.871 ± 0.234
1.915TyrIle: 1.915 ± 0.359
1.915TyrLys: 1.915 ± 0.394
2.147TyrLeu: 2.147 ± 0.342
0.406TyrMet: 0.406 ± 0.199
1.973TyrAsn: 1.973 ± 0.344
1.393TyrPro: 1.393 ± 0.316
1.509TyrGln: 1.509 ± 0.296
1.857TyrArg: 1.857 ± 0.323
2.612TyrSer: 2.612 ± 0.402
1.509TyrThr: 1.509 ± 0.38
2.786TyrVal: 2.786 ± 0.339
0.522TyrTrp: 0.522 ± 0.194
1.741TyrTyr: 1.741 ± 0.44
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (17232 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski