Amino acid dipepetide frequency for Streptomyces phage Salete

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.356AlaAla: 22.356 ± 1.629
1.207AlaCys: 1.207 ± 0.305
8.046AlaAsp: 8.046 ± 0.883
10.977AlaGlu: 10.977 ± 0.761
3.966AlaPhe: 3.966 ± 0.46
11.667AlaGly: 11.667 ± 0.883
2.586AlaHis: 2.586 ± 0.322
3.966AlaIle: 3.966 ± 0.663
3.563AlaLys: 3.563 ± 0.559
10.575AlaLeu: 10.575 ± 1.092
3.563AlaMet: 3.563 ± 0.389
3.506AlaAsn: 3.506 ± 0.516
6.322AlaPro: 6.322 ± 0.898
4.195AlaGln: 4.195 ± 0.422
8.448AlaArg: 8.448 ± 0.769
7.759AlaSer: 7.759 ± 0.798
9.828AlaThr: 9.828 ± 0.794
10.345AlaVal: 10.345 ± 0.88
1.437AlaTrp: 1.437 ± 0.281
2.816AlaTyr: 2.816 ± 0.362
0.0AlaXaa: 0.0 ± 0.0
Cys
2.241CysAla: 2.241 ± 0.357
0.517CysCys: 0.517 ± 0.213
0.92CysAsp: 0.92 ± 0.281
0.287CysGlu: 0.287 ± 0.123
0.345CysPhe: 0.345 ± 0.132
2.241CysGly: 2.241 ± 0.514
0.517CysHis: 0.517 ± 0.205
0.46CysIle: 0.46 ± 0.172
0.287CysLys: 0.287 ± 0.12
1.092CysLeu: 1.092 ± 0.266
0.345CysMet: 0.345 ± 0.153
0.402CysAsn: 0.402 ± 0.144
1.264CysPro: 1.264 ± 0.322
0.69CysGln: 0.69 ± 0.196
0.92CysArg: 0.92 ± 0.233
0.575CysSer: 0.575 ± 0.216
1.034CysThr: 1.034 ± 0.27
1.322CysVal: 1.322 ± 0.27
0.345CysTrp: 0.345 ± 0.166
0.46CysTyr: 0.46 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
9.828AspAla: 9.828 ± 0.662
1.322AspCys: 1.322 ± 0.346
4.598AspAsp: 4.598 ± 0.884
3.966AspGlu: 3.966 ± 0.526
1.839AspPhe: 1.839 ± 0.292
5.575AspGly: 5.575 ± 0.639
1.264AspHis: 1.264 ± 0.285
2.126AspIle: 2.126 ± 0.404
0.977AspLys: 0.977 ± 0.257
5.747AspLeu: 5.747 ± 0.666
1.092AspMet: 1.092 ± 0.305
1.207AspAsn: 1.207 ± 0.191
3.793AspPro: 3.793 ± 0.55
2.241AspGln: 2.241 ± 0.35
5.057AspArg: 5.057 ± 0.644
3.046AspSer: 3.046 ± 0.377
3.218AspThr: 3.218 ± 0.402
5.862AspVal: 5.862 ± 0.556
0.92AspTrp: 0.92 ± 0.317
1.034AspTyr: 1.034 ± 0.237
0.0AspXaa: 0.0 ± 0.0
Glu
7.931GluAla: 7.931 ± 0.688
1.437GluCys: 1.437 ± 0.276
3.736GluAsp: 3.736 ± 0.515
1.264GluGlu: 1.264 ± 0.292
1.379GluPhe: 1.379 ± 0.249
3.908GluGly: 3.908 ± 0.456
0.977GluHis: 0.977 ± 0.221
1.149GluIle: 1.149 ± 0.327
1.034GluLys: 1.034 ± 0.219
5.862GluLeu: 5.862 ± 0.654
1.034GluMet: 1.034 ± 0.227
0.632GluAsn: 0.632 ± 0.22
3.218GluPro: 3.218 ± 0.492
1.724GluGln: 1.724 ± 0.336
4.425GluArg: 4.425 ± 0.584
2.816GluSer: 2.816 ± 0.335
3.563GluThr: 3.563 ± 0.448
3.851GluVal: 3.851 ± 0.489
1.034GluTrp: 1.034 ± 0.204
0.977GluTyr: 0.977 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
3.448PheAla: 3.448 ± 0.36
0.172PheCys: 0.172 ± 0.093
2.011PheAsp: 2.011 ± 0.441
1.954PheGlu: 1.954 ± 0.31
0.747PhePhe: 0.747 ± 0.26
2.989PheGly: 2.989 ± 0.403
0.46PheHis: 0.46 ± 0.142
0.575PheIle: 0.575 ± 0.174
0.23PheLys: 0.23 ± 0.129
2.759PheLeu: 2.759 ± 0.431
0.517PheMet: 0.517 ± 0.146
1.264PheAsn: 1.264 ± 0.306
1.322PhePro: 1.322 ± 0.266
1.149PheGln: 1.149 ± 0.27
1.552PheArg: 1.552 ± 0.272
1.207PheSer: 1.207 ± 0.319
2.126PheThr: 2.126 ± 0.351
2.011PheVal: 2.011 ± 0.313
0.172PheTrp: 0.172 ± 0.102
0.402PheTyr: 0.402 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
9.77GlyAla: 9.77 ± 1.412
2.184GlyCys: 2.184 ± 0.388
5.575GlyAsp: 5.575 ± 0.759
4.195GlyGlu: 4.195 ± 0.459
2.874GlyPhe: 2.874 ± 0.4
9.195GlyGly: 9.195 ± 2.117
1.954GlyHis: 1.954 ± 0.346
3.391GlyIle: 3.391 ± 0.41
3.103GlyLys: 3.103 ± 0.588
7.069GlyLeu: 7.069 ± 0.732
1.724GlyMet: 1.724 ± 0.38
2.471GlyAsn: 2.471 ± 0.351
4.713GlyPro: 4.713 ± 0.583
3.966GlyGln: 3.966 ± 0.466
6.667GlyArg: 6.667 ± 0.585
4.77GlySer: 4.77 ± 0.615
6.264GlyThr: 6.264 ± 0.57
6.264GlyVal: 6.264 ± 0.545
2.184GlyTrp: 2.184 ± 0.394
1.839GlyTyr: 1.839 ± 0.306
0.0GlyXaa: 0.0 ± 0.0
His
2.414HisAla: 2.414 ± 0.378
0.402HisCys: 0.402 ± 0.161
0.977HisAsp: 0.977 ± 0.194
0.805HisGlu: 0.805 ± 0.199
0.46HisPhe: 0.46 ± 0.147
1.437HisGly: 1.437 ± 0.346
0.575HisHis: 0.575 ± 0.143
0.575HisIle: 0.575 ± 0.175
0.287HisLys: 0.287 ± 0.141
1.149HisLeu: 1.149 ± 0.236
0.575HisMet: 0.575 ± 0.174
0.805HisAsn: 0.805 ± 0.232
0.862HisPro: 0.862 ± 0.284
0.805HisGln: 0.805 ± 0.182
1.954HisArg: 1.954 ± 0.308
0.69HisSer: 0.69 ± 0.201
1.322HisThr: 1.322 ± 0.254
2.126HisVal: 2.126 ± 0.356
0.862HisTrp: 0.862 ± 0.235
0.345HisTyr: 0.345 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
4.713IleAla: 4.713 ± 0.583
0.517IleCys: 0.517 ± 0.17
1.552IleAsp: 1.552 ± 0.315
3.046IleGlu: 3.046 ± 0.334
0.69IlePhe: 0.69 ± 0.142
2.931IleGly: 2.931 ± 0.4
0.747IleHis: 0.747 ± 0.187
1.494IleIle: 1.494 ± 0.351
1.092IleLys: 1.092 ± 0.279
2.414IleLeu: 2.414 ± 0.6
0.69IleMet: 0.69 ± 0.185
1.322IleAsn: 1.322 ± 0.414
2.471IlePro: 2.471 ± 0.405
1.322IleGln: 1.322 ± 0.233
2.241IleArg: 2.241 ± 0.352
2.471IleSer: 2.471 ± 0.357
2.414IleThr: 2.414 ± 0.492
2.126IleVal: 2.126 ± 0.368
0.345IleTrp: 0.345 ± 0.113
0.575IleTyr: 0.575 ± 0.181
0.0IleXaa: 0.0 ± 0.0
Lys
4.08LysAla: 4.08 ± 0.623
0.23LysCys: 0.23 ± 0.106
1.897LysAsp: 1.897 ± 0.331
0.862LysGlu: 0.862 ± 0.245
0.575LysPhe: 0.575 ± 0.146
2.701LysGly: 2.701 ± 0.373
0.92LysHis: 0.92 ± 0.264
1.034LysIle: 1.034 ± 0.227
1.322LysLys: 1.322 ± 0.307
2.931LysLeu: 2.931 ± 0.375
0.977LysMet: 0.977 ± 0.292
0.92LysAsn: 0.92 ± 0.269
2.011LysPro: 2.011 ± 0.386
1.264LysGln: 1.264 ± 0.246
1.724LysArg: 1.724 ± 0.368
1.782LysSer: 1.782 ± 0.364
1.954LysThr: 1.954 ± 0.393
1.897LysVal: 1.897 ± 0.29
0.23LysTrp: 0.23 ± 0.098
0.575LysTyr: 0.575 ± 0.158
0.0LysXaa: 0.0 ± 0.0
Leu
10.862LeuAla: 10.862 ± 1.044
1.092LeuCys: 1.092 ± 0.317
6.207LeuAsp: 6.207 ± 0.559
3.793LeuGlu: 3.793 ± 0.377
2.241LeuPhe: 2.241 ± 0.35
7.529LeuGly: 7.529 ± 0.944
0.977LeuHis: 0.977 ± 0.253
3.103LeuIle: 3.103 ± 0.594
3.563LeuLys: 3.563 ± 0.399
6.494LeuLeu: 6.494 ± 0.844
1.839LeuMet: 1.839 ± 0.328
2.471LeuAsn: 2.471 ± 0.305
4.598LeuPro: 4.598 ± 0.422
1.322LeuGln: 1.322 ± 0.295
5.977LeuArg: 5.977 ± 0.748
4.77LeuSer: 4.77 ± 0.504
8.046LeuThr: 8.046 ± 0.791
6.667LeuVal: 6.667 ± 0.534
0.69LeuTrp: 0.69 ± 0.232
1.494LeuTyr: 1.494 ± 0.23
0.0LeuXaa: 0.0 ± 0.0
Met
4.08MetAla: 4.08 ± 0.522
0.115MetCys: 0.115 ± 0.069
1.437MetAsp: 1.437 ± 0.296
0.632MetGlu: 0.632 ± 0.209
0.402MetPhe: 0.402 ± 0.171
1.724MetGly: 1.724 ± 0.305
0.172MetHis: 0.172 ± 0.091
1.207MetIle: 1.207 ± 0.277
1.322MetLys: 1.322 ± 0.289
1.667MetLeu: 1.667 ± 0.265
0.402MetMet: 0.402 ± 0.13
0.747MetAsn: 0.747 ± 0.221
0.977MetPro: 0.977 ± 0.214
0.69MetGln: 0.69 ± 0.186
1.609MetArg: 1.609 ± 0.326
1.494MetSer: 1.494 ± 0.28
1.552MetThr: 1.552 ± 0.287
1.322MetVal: 1.322 ± 0.261
0.402MetTrp: 0.402 ± 0.157
0.575MetTyr: 0.575 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
3.448AsnAla: 3.448 ± 0.427
0.46AsnCys: 0.46 ± 0.15
1.609AsnAsp: 1.609 ± 0.272
1.034AsnGlu: 1.034 ± 0.235
0.862AsnPhe: 0.862 ± 0.174
3.333AsnGly: 3.333 ± 0.408
0.345AsnHis: 0.345 ± 0.159
1.264AsnIle: 1.264 ± 0.271
0.402AsnLys: 0.402 ± 0.156
2.529AsnLeu: 2.529 ± 0.594
0.632AsnMet: 0.632 ± 0.204
0.46AsnAsn: 0.46 ± 0.145
1.782AsnPro: 1.782 ± 0.295
1.379AsnGln: 1.379 ± 0.243
2.069AsnArg: 2.069 ± 0.325
1.149AsnSer: 1.149 ± 0.243
1.379AsnThr: 1.379 ± 0.237
2.069AsnVal: 2.069 ± 0.342
0.23AsnTrp: 0.23 ± 0.095
0.517AsnTyr: 0.517 ± 0.145
0.0AsnXaa: 0.0 ± 0.0
Pro
7.874ProAla: 7.874 ± 0.831
1.092ProCys: 1.092 ± 0.296
4.023ProAsp: 4.023 ± 0.501
4.425ProGlu: 4.425 ± 0.562
1.092ProPhe: 1.092 ± 0.234
5.402ProGly: 5.402 ± 0.508
1.034ProHis: 1.034 ± 0.266
1.609ProIle: 1.609 ± 0.393
2.126ProLys: 2.126 ± 0.494
3.966ProLeu: 3.966 ± 0.654
0.92ProMet: 0.92 ± 0.289
1.264ProAsn: 1.264 ± 0.247
3.161ProPro: 3.161 ± 0.576
1.609ProGln: 1.609 ± 0.349
3.103ProArg: 3.103 ± 0.521
3.678ProSer: 3.678 ± 0.484
2.586ProThr: 2.586 ± 0.375
5.747ProVal: 5.747 ± 0.422
1.149ProTrp: 1.149 ± 0.238
1.034ProTyr: 1.034 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
4.54GlnAla: 4.54 ± 0.499
0.69GlnCys: 0.69 ± 0.256
1.782GlnAsp: 1.782 ± 0.263
0.632GlnGlu: 0.632 ± 0.193
1.149GlnPhe: 1.149 ± 0.236
2.184GlnGly: 2.184 ± 0.296
0.402GlnHis: 0.402 ± 0.167
1.034GlnIle: 1.034 ± 0.233
0.517GlnLys: 0.517 ± 0.196
4.253GlnLeu: 4.253 ± 0.517
0.977GlnMet: 0.977 ± 0.212
0.805GlnAsn: 0.805 ± 0.217
2.241GlnPro: 2.241 ± 0.383
1.437GlnGln: 1.437 ± 0.247
2.356GlnArg: 2.356 ± 0.444
0.977GlnSer: 0.977 ± 0.239
2.011GlnThr: 2.011 ± 0.397
2.414GlnVal: 2.414 ± 0.329
1.034GlnTrp: 1.034 ± 0.231
0.862GlnTyr: 0.862 ± 0.205
0.0GlnXaa: 0.0 ± 0.0
Arg
8.103ArgAla: 8.103 ± 0.818
1.149ArgCys: 1.149 ± 0.287
4.54ArgAsp: 4.54 ± 0.645
2.989ArgGlu: 2.989 ± 0.441
1.552ArgPhe: 1.552 ± 0.308
4.77ArgGly: 4.77 ± 0.532
1.667ArgHis: 1.667 ± 0.318
3.046ArgIle: 3.046 ± 0.41
2.989ArgLys: 2.989 ± 0.544
5.747ArgLeu: 5.747 ± 0.625
2.471ArgMet: 2.471 ± 0.348
2.069ArgAsn: 2.069 ± 0.322
2.989ArgPro: 2.989 ± 0.436
2.241ArgGln: 2.241 ± 0.299
5.69ArgArg: 5.69 ± 0.654
2.701ArgSer: 2.701 ± 0.331
5.23ArgThr: 5.23 ± 0.554
5.862ArgVal: 5.862 ± 0.565
1.264ArgTrp: 1.264 ± 0.25
1.839ArgTyr: 1.839 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
6.322SerAla: 6.322 ± 0.632
1.034SerCys: 1.034 ± 0.331
2.701SerAsp: 2.701 ± 0.46
2.759SerGlu: 2.759 ± 0.42
2.011SerPhe: 2.011 ± 0.335
6.609SerGly: 6.609 ± 0.648
1.092SerHis: 1.092 ± 0.226
1.609SerIle: 1.609 ± 0.304
1.322SerLys: 1.322 ± 0.271
4.138SerLeu: 4.138 ± 0.469
1.552SerMet: 1.552 ± 0.265
0.977SerAsn: 0.977 ± 0.208
3.333SerPro: 3.333 ± 0.374
1.092SerGln: 1.092 ± 0.237
3.276SerArg: 3.276 ± 0.504
2.931SerSer: 2.931 ± 0.594
3.621SerThr: 3.621 ± 0.433
4.195SerVal: 4.195 ± 0.497
0.862SerTrp: 0.862 ± 0.225
1.264SerTyr: 1.264 ± 0.311
0.0SerXaa: 0.0 ± 0.0
Thr
9.425ThrAla: 9.425 ± 0.87
0.977ThrCys: 0.977 ± 0.34
3.966ThrAsp: 3.966 ± 0.541
4.54ThrGlu: 4.54 ± 0.484
1.667ThrPhe: 1.667 ± 0.24
8.103ThrGly: 8.103 ± 0.662
1.207ThrHis: 1.207 ± 0.276
2.701ThrIle: 2.701 ± 0.395
1.954ThrLys: 1.954 ± 0.227
5.632ThrLeu: 5.632 ± 0.51
1.437ThrMet: 1.437 ± 0.332
1.552ThrAsn: 1.552 ± 0.28
3.966ThrPro: 3.966 ± 0.459
1.782ThrGln: 1.782 ± 0.266
3.506ThrArg: 3.506 ± 0.419
3.276ThrSer: 3.276 ± 0.462
3.966ThrThr: 3.966 ± 0.577
6.322ThrVal: 6.322 ± 0.47
1.092ThrTrp: 1.092 ± 0.244
1.552ThrTyr: 1.552 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
11.782ValAla: 11.782 ± 1.065
0.92ValCys: 0.92 ± 0.254
6.724ValAsp: 6.724 ± 0.506
2.299ValGlu: 2.299 ± 0.359
2.299ValPhe: 2.299 ± 0.349
4.368ValGly: 4.368 ± 0.547
1.724ValHis: 1.724 ± 0.279
3.736ValIle: 3.736 ± 0.442
2.529ValLys: 2.529 ± 0.362
6.034ValLeu: 6.034 ± 0.786
1.264ValMet: 1.264 ± 0.269
2.529ValAsn: 2.529 ± 0.307
5.805ValPro: 5.805 ± 0.701
2.241ValGln: 2.241 ± 0.382
5.69ValArg: 5.69 ± 0.564
4.54ValSer: 4.54 ± 0.575
6.552ValThr: 6.552 ± 0.715
6.092ValVal: 6.092 ± 0.621
1.494ValTrp: 1.494 ± 0.313
0.862ValTyr: 0.862 ± 0.258
0.0ValXaa: 0.0 ± 0.0
Trp
1.839TrpAla: 1.839 ± 0.287
0.345TrpCys: 0.345 ± 0.122
0.977TrpAsp: 0.977 ± 0.229
0.23TrpGlu: 0.23 ± 0.11
0.575TrpPhe: 0.575 ± 0.208
0.92TrpGly: 0.92 ± 0.263
0.402TrpHis: 0.402 ± 0.265
0.345TrpIle: 0.345 ± 0.141
0.747TrpLys: 0.747 ± 0.195
1.782TrpLeu: 1.782 ± 0.341
0.172TrpMet: 0.172 ± 0.086
0.862TrpAsn: 0.862 ± 0.226
1.092TrpPro: 1.092 ± 0.264
0.46TrpGln: 0.46 ± 0.147
0.805TrpArg: 0.805 ± 0.238
1.437TrpSer: 1.437 ± 0.248
0.977TrpThr: 0.977 ± 0.242
1.379TrpVal: 1.379 ± 0.302
0.517TrpTrp: 0.517 ± 0.166
0.517TrpTyr: 0.517 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.069TyrAla: 2.069 ± 0.339
0.402TyrCys: 0.402 ± 0.165
1.437TyrAsp: 1.437 ± 0.304
1.264TyrGlu: 1.264 ± 0.288
0.345TyrPhe: 0.345 ± 0.128
2.586TyrGly: 2.586 ± 0.452
0.402TyrHis: 0.402 ± 0.126
0.747TyrIle: 0.747 ± 0.19
0.517TyrLys: 0.517 ± 0.149
1.897TyrLeu: 1.897 ± 0.338
0.287TyrMet: 0.287 ± 0.109
0.69TyrAsn: 0.69 ± 0.187
0.977TyrPro: 0.977 ± 0.224
0.69TyrGln: 0.69 ± 0.181
1.839TyrArg: 1.839 ± 0.416
0.69TyrSer: 0.69 ± 0.181
0.977TyrThr: 0.977 ± 0.239
1.609TyrVal: 1.609 ± 0.329
0.115TyrTrp: 0.115 ± 0.078
0.172TyrTyr: 0.172 ± 0.095
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (17401 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski