Amino acid dipepetide frequency for Streptomyces phage BRock

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.877AlaAla: 7.877 ± 0.794
0.769AlaCys: 0.769 ± 0.167
4.369AlaAsp: 4.369 ± 0.396
5.261AlaGlu: 5.261 ± 0.486
2.646AlaPhe: 2.646 ± 0.265
6.707AlaGly: 6.707 ± 0.743
1.477AlaHis: 1.477 ± 0.25
4.246AlaIle: 4.246 ± 0.353
4.338AlaLys: 4.338 ± 0.472
6.769AlaLeu: 6.769 ± 0.501
2.461AlaMet: 2.461 ± 0.297
3.385AlaAsn: 3.385 ± 0.392
4.061AlaPro: 4.061 ± 0.326
3.661AlaGln: 3.661 ± 0.434
4.461AlaArg: 4.461 ± 0.426
4.954AlaSer: 4.954 ± 0.401
5.938AlaThr: 5.938 ± 0.582
6.184AlaVal: 6.184 ± 0.494
1.292AlaTrp: 1.292 ± 0.155
2.954AlaTyr: 2.954 ± 0.345
0.0AlaXaa: 0.0 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.171
0.123CysCys: 0.123 ± 0.056
0.8CysAsp: 0.8 ± 0.18
0.646CysGlu: 0.646 ± 0.146
0.369CysPhe: 0.369 ± 0.106
1.108CysGly: 1.108 ± 0.232
0.154CysHis: 0.154 ± 0.067
0.369CysIle: 0.369 ± 0.1
0.431CysLys: 0.431 ± 0.139
0.462CysLeu: 0.462 ± 0.119
0.369CysMet: 0.369 ± 0.124
0.462CysAsn: 0.462 ± 0.117
1.046CysPro: 1.046 ± 0.229
0.4CysGln: 0.4 ± 0.137
0.338CysArg: 0.338 ± 0.108
0.431CysSer: 0.431 ± 0.116
0.585CysThr: 0.585 ± 0.143
0.4CysVal: 0.4 ± 0.106
0.215CysTrp: 0.215 ± 0.089
0.338CysTyr: 0.338 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
5.661AspAla: 5.661 ± 0.379
0.738AspCys: 0.738 ± 0.196
4.154AspAsp: 4.154 ± 0.389
4.0AspGlu: 4.0 ± 0.404
2.585AspPhe: 2.585 ± 0.273
4.769AspGly: 4.769 ± 0.426
0.862AspHis: 0.862 ± 0.178
3.385AspIle: 3.385 ± 0.308
2.492AspLys: 2.492 ± 0.31
4.861AspLeu: 4.861 ± 0.406
2.185AspMet: 2.185 ± 0.259
2.4AspAsn: 2.4 ± 0.266
2.923AspPro: 2.923 ± 0.277
1.723AspGln: 1.723 ± 0.245
3.323AspArg: 3.323 ± 0.322
3.354AspSer: 3.354 ± 0.343
3.569AspThr: 3.569 ± 0.33
5.108AspVal: 5.108 ± 0.399
1.292AspTrp: 1.292 ± 0.22
2.985AspTyr: 2.985 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
4.584GluAla: 4.584 ± 0.375
0.738GluCys: 0.738 ± 0.172
4.277GluAsp: 4.277 ± 0.34
5.261GluGlu: 5.261 ± 0.635
2.708GluPhe: 2.708 ± 0.265
4.646GluGly: 4.646 ± 0.406
1.354GluHis: 1.354 ± 0.213
3.846GluIle: 3.846 ± 0.42
3.292GluLys: 3.292 ± 0.405
5.908GluLeu: 5.908 ± 0.512
2.0GluMet: 2.0 ± 0.243
2.738GluAsn: 2.738 ± 0.304
2.646GluPro: 2.646 ± 0.306
2.031GluGln: 2.031 ± 0.256
3.6GluArg: 3.6 ± 0.355
3.446GluSer: 3.446 ± 0.384
3.292GluThr: 3.292 ± 0.313
3.908GluVal: 3.908 ± 0.343
1.261GluTrp: 1.261 ± 0.204
2.985GluTyr: 2.985 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
2.831PheAla: 2.831 ± 0.276
0.246PheCys: 0.246 ± 0.105
2.369PheAsp: 2.369 ± 0.309
2.615PheGlu: 2.615 ± 0.248
0.862PhePhe: 0.862 ± 0.171
3.231PheGly: 3.231 ± 0.287
0.769PheHis: 0.769 ± 0.148
1.538PheIle: 1.538 ± 0.22
1.938PheLys: 1.938 ± 0.314
2.277PheLeu: 2.277 ± 0.292
1.538PheMet: 1.538 ± 0.262
1.569PheAsn: 1.569 ± 0.221
1.385PhePro: 1.385 ± 0.233
1.2PheGln: 1.2 ± 0.188
1.815PheArg: 1.815 ± 0.238
2.431PheSer: 2.431 ± 0.24
2.0PheThr: 2.0 ± 0.251
2.154PheVal: 2.154 ± 0.261
0.8PheTrp: 0.8 ± 0.166
1.261PheTyr: 1.261 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
5.969GlyAla: 5.969 ± 0.792
0.954GlyCys: 0.954 ± 0.235
5.169GlyAsp: 5.169 ± 0.397
4.308GlyGlu: 4.308 ± 0.39
3.077GlyPhe: 3.077 ± 0.345
6.984GlyGly: 6.984 ± 1.048
1.385GlyHis: 1.385 ± 0.216
4.8GlyIle: 4.8 ± 0.41
4.154GlyLys: 4.154 ± 0.387
5.384GlyLeu: 5.384 ± 0.39
2.277GlyMet: 2.277 ± 0.331
3.569GlyAsn: 3.569 ± 0.338
2.523GlyPro: 2.523 ± 0.282
2.769GlyGln: 2.769 ± 0.284
3.908GlyArg: 3.908 ± 0.366
5.384GlySer: 5.384 ± 0.499
7.231GlyThr: 7.231 ± 0.596
5.538GlyVal: 5.538 ± 0.399
1.631GlyTrp: 1.631 ± 0.216
3.538GlyTyr: 3.538 ± 0.342
0.0GlyXaa: 0.0 ± 0.0
His
1.508HisAla: 1.508 ± 0.227
0.308HisCys: 0.308 ± 0.102
0.985HisAsp: 0.985 ± 0.171
1.169HisGlu: 1.169 ± 0.21
0.523HisPhe: 0.523 ± 0.115
1.446HisGly: 1.446 ± 0.206
0.585HisHis: 0.585 ± 0.163
0.769HisIle: 0.769 ± 0.155
1.046HisLys: 1.046 ± 0.171
1.231HisLeu: 1.231 ± 0.182
0.492HisMet: 0.492 ± 0.117
0.923HisAsn: 0.923 ± 0.176
1.108HisPro: 1.108 ± 0.218
0.523HisGln: 0.523 ± 0.119
0.923HisArg: 0.923 ± 0.182
0.862HisSer: 0.862 ± 0.146
1.446HisThr: 1.446 ± 0.22
1.569HisVal: 1.569 ± 0.232
0.523HisTrp: 0.523 ± 0.12
0.862HisTyr: 0.862 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
3.292IleAla: 3.292 ± 0.398
0.769IleCys: 0.769 ± 0.142
3.169IleAsp: 3.169 ± 0.351
2.892IleGlu: 2.892 ± 0.296
1.015IlePhe: 1.015 ± 0.152
3.631IleGly: 3.631 ± 0.348
1.138IleHis: 1.138 ± 0.183
2.154IleIle: 2.154 ± 0.308
2.0IleLys: 2.0 ± 0.228
3.077IleLeu: 3.077 ± 0.313
1.292IleMet: 1.292 ± 0.196
2.154IleAsn: 2.154 ± 0.32
2.923IlePro: 2.923 ± 0.312
2.0IleGln: 2.0 ± 0.264
2.708IleArg: 2.708 ± 0.28
2.892IleSer: 2.892 ± 0.298
3.261IleThr: 3.261 ± 0.364
4.184IleVal: 4.184 ± 0.357
0.585IleTrp: 0.585 ± 0.149
1.908IleTyr: 1.908 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
3.815LysAla: 3.815 ± 0.384
0.523LysCys: 0.523 ± 0.137
2.615LysAsp: 2.615 ± 0.283
3.6LysGlu: 3.6 ± 0.369
2.061LysPhe: 2.061 ± 0.282
2.831LysGly: 2.831 ± 0.272
0.985LysHis: 0.985 ± 0.168
1.969LysIle: 1.969 ± 0.26
2.861LysLys: 2.861 ± 0.373
3.784LysLeu: 3.784 ± 0.367
1.323LysMet: 1.323 ± 0.205
2.061LysAsn: 2.061 ± 0.283
2.246LysPro: 2.246 ± 0.3
1.477LysGln: 1.477 ± 0.239
2.861LysArg: 2.861 ± 0.296
2.708LysSer: 2.708 ± 0.344
2.738LysThr: 2.738 ± 0.321
3.508LysVal: 3.508 ± 0.34
0.892LysTrp: 0.892 ± 0.166
2.215LysTyr: 2.215 ± 0.291
0.0LysXaa: 0.0 ± 0.0
Leu
6.307LeuAla: 6.307 ± 0.441
0.585LeuCys: 0.585 ± 0.134
5.046LeuAsp: 5.046 ± 0.482
4.892LeuGlu: 4.892 ± 0.381
2.246LeuPhe: 2.246 ± 0.272
6.0LeuGly: 6.0 ± 0.496
1.138LeuHis: 1.138 ± 0.186
3.6LeuIle: 3.6 ± 0.317
3.323LeuLys: 3.323 ± 0.312
5.169LeuLeu: 5.169 ± 0.487
2.0LeuMet: 2.0 ± 0.248
3.692LeuAsn: 3.692 ± 0.358
3.569LeuPro: 3.569 ± 0.354
2.861LeuGln: 2.861 ± 0.291
4.308LeuArg: 4.308 ± 0.387
4.615LeuSer: 4.615 ± 0.34
6.0LeuThr: 6.0 ± 0.519
4.431LeuVal: 4.431 ± 0.405
1.323LeuTrp: 1.323 ± 0.204
2.554LeuTyr: 2.554 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
3.015MetAla: 3.015 ± 0.306
0.092MetCys: 0.092 ± 0.053
1.631MetAsp: 1.631 ± 0.198
1.538MetGlu: 1.538 ± 0.249
1.046MetPhe: 1.046 ± 0.176
2.554MetGly: 2.554 ± 0.353
0.462MetHis: 0.462 ± 0.112
1.046MetIle: 1.046 ± 0.243
1.077MetLys: 1.077 ± 0.216
1.785MetLeu: 1.785 ± 0.295
0.769MetMet: 0.769 ± 0.211
1.323MetAsn: 1.323 ± 0.2
1.169MetPro: 1.169 ± 0.185
1.077MetGln: 1.077 ± 0.191
1.754MetArg: 1.754 ± 0.214
2.277MetSer: 2.277 ± 0.258
2.123MetThr: 2.123 ± 0.212
2.0MetVal: 2.0 ± 0.268
0.369MetTrp: 0.369 ± 0.109
0.646MetTyr: 0.646 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
3.631AsnAla: 3.631 ± 0.36
0.246AsnCys: 0.246 ± 0.089
2.646AsnAsp: 2.646 ± 0.351
2.123AsnGlu: 2.123 ± 0.27
1.877AsnPhe: 1.877 ± 0.219
3.784AsnGly: 3.784 ± 0.409
0.769AsnHis: 0.769 ± 0.127
2.308AsnIle: 2.308 ± 0.279
2.061AsnLys: 2.061 ± 0.253
3.569AsnLeu: 3.569 ± 0.377
1.169AsnMet: 1.169 ± 0.199
2.308AsnAsn: 2.308 ± 0.317
3.077AsnPro: 3.077 ± 0.318
1.754AsnGln: 1.754 ± 0.215
2.769AsnArg: 2.769 ± 0.294
2.892AsnSer: 2.892 ± 0.319
2.861AsnThr: 2.861 ± 0.345
3.692AsnVal: 3.692 ± 0.392
0.8AsnTrp: 0.8 ± 0.146
1.477AsnTyr: 1.477 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
3.2ProAla: 3.2 ± 0.291
0.4ProCys: 0.4 ± 0.122
3.231ProAsp: 3.231 ± 0.326
3.661ProGlu: 3.661 ± 0.373
1.261ProPhe: 1.261 ± 0.204
4.123ProGly: 4.123 ± 0.386
0.892ProHis: 0.892 ± 0.161
1.815ProIle: 1.815 ± 0.205
1.938ProLys: 1.938 ± 0.294
3.231ProLeu: 3.231 ± 0.34
1.169ProMet: 1.169 ± 0.204
2.431ProAsn: 2.431 ± 0.292
1.6ProPro: 1.6 ± 0.272
1.969ProGln: 1.969 ± 0.264
2.061ProArg: 2.061 ± 0.247
3.754ProSer: 3.754 ± 0.36
3.631ProThr: 3.631 ± 0.4
3.815ProVal: 3.815 ± 0.419
0.554ProTrp: 0.554 ± 0.129
2.061ProTyr: 2.061 ± 0.259
0.0ProXaa: 0.0 ± 0.0
Gln
3.969GlnAla: 3.969 ± 0.441
0.369GlnCys: 0.369 ± 0.114
1.785GlnAsp: 1.785 ± 0.229
2.4GlnGlu: 2.4 ± 0.264
1.508GlnPhe: 1.508 ± 0.234
3.108GlnGly: 3.108 ± 0.31
0.615GlnHis: 0.615 ± 0.148
1.538GlnIle: 1.538 ± 0.248
1.538GlnLys: 1.538 ± 0.216
2.892GlnLeu: 2.892 ± 0.292
0.8GlnMet: 0.8 ± 0.158
1.415GlnAsn: 1.415 ± 0.207
1.292GlnPro: 1.292 ± 0.218
1.908GlnGln: 1.908 ± 0.323
2.338GlnArg: 2.338 ± 0.257
2.092GlnSer: 2.092 ± 0.264
2.246GlnThr: 2.246 ± 0.25
2.4GlnVal: 2.4 ± 0.23
0.985GlnTrp: 0.985 ± 0.178
1.661GlnTyr: 1.661 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
4.831ArgAla: 4.831 ± 0.327
0.369ArgCys: 0.369 ± 0.106
3.415ArgAsp: 3.415 ± 0.318
3.815ArgGlu: 3.815 ± 0.376
2.061ArgPhe: 2.061 ± 0.259
3.908ArgGly: 3.908 ± 0.363
1.2ArgHis: 1.2 ± 0.195
3.077ArgIle: 3.077 ± 0.303
3.015ArgLys: 3.015 ± 0.33
3.692ArgLeu: 3.692 ± 0.409
1.508ArgMet: 1.508 ± 0.197
2.585ArgAsn: 2.585 ± 0.329
2.185ArgPro: 2.185 ± 0.282
2.185ArgGln: 2.185 ± 0.296
3.354ArgArg: 3.354 ± 0.389
2.8ArgSer: 2.8 ± 0.288
4.0ArgThr: 4.0 ± 0.416
4.861ArgVal: 4.861 ± 0.387
1.046ArgTrp: 1.046 ± 0.185
2.215ArgTyr: 2.215 ± 0.257
0.0ArgXaa: 0.0 ± 0.0
Ser
4.8SerAla: 4.8 ± 0.402
0.431SerCys: 0.431 ± 0.112
3.446SerAsp: 3.446 ± 0.368
3.415SerGlu: 3.415 ± 0.351
2.185SerPhe: 2.185 ± 0.228
5.877SerGly: 5.877 ± 0.484
1.046SerHis: 1.046 ± 0.162
2.677SerIle: 2.677 ± 0.239
3.138SerLys: 3.138 ± 0.305
4.308SerLeu: 4.308 ± 0.357
1.877SerMet: 1.877 ± 0.269
2.646SerAsn: 2.646 ± 0.417
3.015SerPro: 3.015 ± 0.319
2.523SerGln: 2.523 ± 0.256
4.0SerArg: 4.0 ± 0.333
3.631SerSer: 3.631 ± 0.402
4.492SerThr: 4.492 ± 0.416
4.646SerVal: 4.646 ± 0.298
1.015SerTrp: 1.015 ± 0.176
2.0SerTyr: 2.0 ± 0.281
0.0SerXaa: 0.0 ± 0.0
Thr
6.492ThrAla: 6.492 ± 0.572
0.8ThrCys: 0.8 ± 0.184
4.308ThrAsp: 4.308 ± 0.44
4.0ThrGlu: 4.0 ± 0.368
2.585ThrPhe: 2.585 ± 0.337
6.0ThrGly: 6.0 ± 0.442
1.261ThrHis: 1.261 ± 0.244
2.985ThrIle: 2.985 ± 0.389
2.492ThrLys: 2.492 ± 0.32
5.908ThrLeu: 5.908 ± 0.458
1.2ThrMet: 1.2 ± 0.177
3.508ThrAsn: 3.508 ± 0.376
4.123ThrPro: 4.123 ± 0.381
2.123ThrGln: 2.123 ± 0.344
3.323ThrArg: 3.323 ± 0.244
4.246ThrSer: 4.246 ± 0.423
5.569ThrThr: 5.569 ± 0.577
5.938ThrVal: 5.938 ± 0.519
1.354ThrTrp: 1.354 ± 0.215
2.677ThrTyr: 2.677 ± 0.308
0.0ThrXaa: 0.0 ± 0.0
Val
6.338ValAla: 6.338 ± 0.496
0.677ValCys: 0.677 ± 0.143
4.369ValAsp: 4.369 ± 0.37
4.831ValGlu: 4.831 ± 0.421
2.492ValPhe: 2.492 ± 0.247
5.015ValGly: 5.015 ± 0.422
1.385ValHis: 1.385 ± 0.184
3.108ValIle: 3.108 ± 0.309
3.292ValLys: 3.292 ± 0.306
5.415ValLeu: 5.415 ± 0.424
1.785ValMet: 1.785 ± 0.224
3.508ValAsn: 3.508 ± 0.362
3.784ValPro: 3.784 ± 0.419
2.585ValGln: 2.585 ± 0.263
4.461ValArg: 4.461 ± 0.365
4.769ValSer: 4.769 ± 0.316
6.246ValThr: 6.246 ± 0.616
5.292ValVal: 5.292 ± 0.419
1.815ValTrp: 1.815 ± 0.227
2.738ValTyr: 2.738 ± 0.27
0.0ValXaa: 0.0 ± 0.0
Trp
1.692TrpAla: 1.692 ± 0.227
0.215TrpCys: 0.215 ± 0.087
1.477TrpAsp: 1.477 ± 0.218
1.385TrpGlu: 1.385 ± 0.225
0.523TrpPhe: 0.523 ± 0.12
1.2TrpGly: 1.2 ± 0.198
0.554TrpHis: 0.554 ± 0.141
0.585TrpIle: 0.585 ± 0.148
0.892TrpLys: 0.892 ± 0.164
1.477TrpLeu: 1.477 ± 0.222
0.462TrpMet: 0.462 ± 0.103
0.923TrpAsn: 0.923 ± 0.178
0.738TrpPro: 0.738 ± 0.135
0.738TrpGln: 0.738 ± 0.162
0.985TrpArg: 0.985 ± 0.179
1.477TrpSer: 1.477 ± 0.183
1.108TrpThr: 1.108 ± 0.206
1.415TrpVal: 1.415 ± 0.19
0.246TrpTrp: 0.246 ± 0.097
0.8TrpTyr: 0.8 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.323TyrAla: 3.323 ± 0.288
0.369TyrCys: 0.369 ± 0.103
3.108TyrAsp: 3.108 ± 0.359
2.708TyrGlu: 2.708 ± 0.33
1.2TyrPhe: 1.2 ± 0.201
3.631TyrGly: 3.631 ± 0.374
0.769TyrHis: 0.769 ± 0.17
1.261TyrIle: 1.261 ± 0.225
1.754TyrLys: 1.754 ± 0.224
2.461TyrLeu: 2.461 ± 0.312
1.046TyrMet: 1.046 ± 0.199
2.215TyrAsn: 2.215 ± 0.278
1.385TyrPro: 1.385 ± 0.243
1.354TyrGln: 1.354 ± 0.21
2.831TyrArg: 2.831 ± 0.265
2.154TyrSer: 2.154 ± 0.26
2.554TyrThr: 2.554 ± 0.246
2.861TyrVal: 2.861 ± 0.305
0.892TyrTrp: 0.892 ± 0.15
1.538TyrTyr: 1.538 ± 0.214
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 188 proteins (32502 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski