Amino acid dipepetide frequency for Streptomyces phage Mildred21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.028AlaAla: 8.028 ± 0.846
0.811AlaCys: 0.811 ± 0.164
5.304AlaAsp: 5.304 ± 0.362
5.42AlaGlu: 5.42 ± 0.468
3.159AlaPhe: 3.159 ± 0.279
5.564AlaGly: 5.564 ± 0.526
1.391AlaHis: 1.391 ± 0.226
4.811AlaIle: 4.811 ± 0.459
5.738AlaLys: 5.738 ± 0.491
6.579AlaLeu: 6.579 ± 0.511
2.608AlaMet: 2.608 ± 0.337
3.449AlaAsn: 3.449 ± 0.429
3.072AlaPro: 3.072 ± 0.478
2.84AlaGln: 2.84 ± 0.376
5.159AlaArg: 5.159 ± 0.47
4.608AlaSer: 4.608 ± 0.655
5.304AlaThr: 5.304 ± 0.72
5.593AlaVal: 5.593 ± 0.353
1.623AlaTrp: 1.623 ± 0.176
3.217AlaTyr: 3.217 ± 0.276
0.0AlaXaa: 0.0 ± 0.0
Cys
0.406CysAla: 0.406 ± 0.116
0.203CysCys: 0.203 ± 0.088
0.696CysAsp: 0.696 ± 0.179
0.725CysGlu: 0.725 ± 0.128
0.29CysPhe: 0.29 ± 0.108
1.159CysGly: 1.159 ± 0.211
0.261CysHis: 0.261 ± 0.088
0.377CysIle: 0.377 ± 0.103
0.985CysLys: 0.985 ± 0.2
0.667CysLeu: 0.667 ± 0.156
0.29CysMet: 0.29 ± 0.092
0.725CysAsn: 0.725 ± 0.152
0.522CysPro: 0.522 ± 0.127
0.348CysGln: 0.348 ± 0.105
0.667CysArg: 0.667 ± 0.156
0.609CysSer: 0.609 ± 0.163
0.464CysThr: 0.464 ± 0.114
0.754CysVal: 0.754 ± 0.145
0.232CysTrp: 0.232 ± 0.078
0.522CysTyr: 0.522 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
5.796AspAla: 5.796 ± 0.459
0.667AspCys: 0.667 ± 0.153
4.173AspAsp: 4.173 ± 0.468
5.506AspGlu: 5.506 ± 0.431
3.42AspPhe: 3.42 ± 0.32
5.391AspGly: 5.391 ± 0.448
1.13AspHis: 1.13 ± 0.2
3.883AspIle: 3.883 ± 0.394
3.739AspLys: 3.739 ± 0.406
4.666AspLeu: 4.666 ± 0.379
2.029AspMet: 2.029 ± 0.259
2.927AspAsn: 2.927 ± 0.275
2.319AspPro: 2.319 ± 0.292
1.42AspGln: 1.42 ± 0.167
2.898AspArg: 2.898 ± 0.279
3.912AspSer: 3.912 ± 0.315
3.246AspThr: 3.246 ± 0.385
4.579AspVal: 4.579 ± 0.432
1.71AspTrp: 1.71 ± 0.267
2.492AspTyr: 2.492 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
6.666GluAla: 6.666 ± 0.584
0.58GluCys: 0.58 ± 0.133
4.579GluAsp: 4.579 ± 0.392
5.217GluGlu: 5.217 ± 0.528
3.188GluPhe: 3.188 ± 0.293
4.724GluGly: 4.724 ± 0.335
1.449GluHis: 1.449 ± 0.233
4.202GluIle: 4.202 ± 0.389
4.724GluLys: 4.724 ± 0.438
5.188GluLeu: 5.188 ± 0.485
2.087GluMet: 2.087 ± 0.243
3.13GluAsn: 3.13 ± 0.391
2.29GluPro: 2.29 ± 0.331
2.521GluGln: 2.521 ± 0.326
4.376GluArg: 4.376 ± 0.413
3.304GluSer: 3.304 ± 0.362
3.333GluThr: 3.333 ± 0.352
5.159GluVal: 5.159 ± 0.445
1.449GluTrp: 1.449 ± 0.198
2.84GluTyr: 2.84 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
2.724PheAla: 2.724 ± 0.259
0.377PheCys: 0.377 ± 0.112
3.507PheAsp: 3.507 ± 0.329
3.739PheGlu: 3.739 ± 0.338
1.507PhePhe: 1.507 ± 0.213
3.043PheGly: 3.043 ± 0.272
0.754PheHis: 0.754 ± 0.199
1.652PheIle: 1.652 ± 0.201
2.116PheLys: 2.116 ± 0.235
2.174PheLeu: 2.174 ± 0.276
1.101PheMet: 1.101 ± 0.196
2.145PheAsn: 2.145 ± 0.21
1.072PhePro: 1.072 ± 0.173
1.014PheGln: 1.014 ± 0.172
2.0PheArg: 2.0 ± 0.222
2.782PheSer: 2.782 ± 0.33
2.405PheThr: 2.405 ± 0.273
2.724PheVal: 2.724 ± 0.323
0.551PheTrp: 0.551 ± 0.126
1.217PheTyr: 1.217 ± 0.15
0.0PheXaa: 0.0 ± 0.0
Gly
5.304GlyAla: 5.304 ± 0.368
0.609GlyCys: 0.609 ± 0.13
4.463GlyAsp: 4.463 ± 0.337
4.086GlyGlu: 4.086 ± 0.328
2.927GlyPhe: 2.927 ± 0.269
5.014GlyGly: 5.014 ± 0.539
1.826GlyHis: 1.826 ± 0.237
4.666GlyIle: 4.666 ± 0.407
5.014GlyLys: 5.014 ± 0.456
5.072GlyLeu: 5.072 ± 0.402
2.55GlyMet: 2.55 ± 0.248
3.449GlyAsn: 3.449 ± 0.306
2.203GlyPro: 2.203 ± 0.333
2.521GlyGln: 2.521 ± 0.304
3.999GlyArg: 3.999 ± 0.359
4.405GlySer: 4.405 ± 0.543
5.333GlyThr: 5.333 ± 0.82
5.999GlyVal: 5.999 ± 0.439
1.623GlyTrp: 1.623 ± 0.227
2.985GlyTyr: 2.985 ± 0.32
0.0GlyXaa: 0.0 ± 0.0
His
1.072HisAla: 1.072 ± 0.173
0.348HisCys: 0.348 ± 0.095
1.217HisAsp: 1.217 ± 0.235
1.101HisGlu: 1.101 ± 0.176
0.898HisPhe: 0.898 ± 0.173
1.681HisGly: 1.681 ± 0.243
0.464HisHis: 0.464 ± 0.117
1.275HisIle: 1.275 ± 0.214
1.072HisLys: 1.072 ± 0.166
1.42HisLeu: 1.42 ± 0.188
0.58HisMet: 0.58 ± 0.128
0.696HisAsn: 0.696 ± 0.162
0.84HisPro: 0.84 ± 0.165
0.551HisGln: 0.551 ± 0.127
1.623HisArg: 1.623 ± 0.259
1.101HisSer: 1.101 ± 0.162
0.869HisThr: 0.869 ± 0.163
1.304HisVal: 1.304 ± 0.211
0.319HisTrp: 0.319 ± 0.096
0.927HisTyr: 0.927 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
4.724IleAla: 4.724 ± 0.404
0.869IleCys: 0.869 ± 0.183
4.434IleAsp: 4.434 ± 0.345
4.463IleGlu: 4.463 ± 0.43
1.391IlePhe: 1.391 ± 0.2
3.826IleGly: 3.826 ± 0.368
0.956IleHis: 0.956 ± 0.197
2.087IleIle: 2.087 ± 0.241
2.985IleLys: 2.985 ± 0.299
3.71IleLeu: 3.71 ± 0.39
1.159IleMet: 1.159 ± 0.16
2.0IleAsn: 2.0 ± 0.278
2.463IlePro: 2.463 ± 0.28
1.652IleGln: 1.652 ± 0.356
3.449IleArg: 3.449 ± 0.297
3.101IleSer: 3.101 ± 0.329
3.072IleThr: 3.072 ± 0.27
4.434IleVal: 4.434 ± 0.348
0.754IleTrp: 0.754 ± 0.157
1.797IleTyr: 1.797 ± 0.293
0.0IleXaa: 0.0 ± 0.0
Lys
6.028LysAla: 6.028 ± 0.492
0.956LysCys: 0.956 ± 0.239
4.144LysAsp: 4.144 ± 0.419
4.26LysGlu: 4.26 ± 0.416
2.058LysPhe: 2.058 ± 0.278
4.405LysGly: 4.405 ± 0.411
1.333LysHis: 1.333 ± 0.228
2.985LysIle: 2.985 ± 0.376
4.434LysLys: 4.434 ± 0.412
4.521LysLeu: 4.521 ± 0.439
2.29LysMet: 2.29 ± 0.23
3.42LysAsn: 3.42 ± 0.344
2.782LysPro: 2.782 ± 0.296
2.087LysGln: 2.087 ± 0.269
3.855LysArg: 3.855 ± 0.408
3.304LysSer: 3.304 ± 0.329
3.912LysThr: 3.912 ± 0.391
3.855LysVal: 3.855 ± 0.3
1.13LysTrp: 1.13 ± 0.19
2.463LysTyr: 2.463 ± 0.31
0.0LysXaa: 0.0 ± 0.0
Leu
7.274LeuAla: 7.274 ± 0.582
0.898LeuCys: 0.898 ± 0.176
4.724LeuAsp: 4.724 ± 0.365
5.42LeuGlu: 5.42 ± 0.436
2.203LeuPhe: 2.203 ± 0.29
4.695LeuGly: 4.695 ± 0.38
1.246LeuHis: 1.246 ± 0.183
4.463LeuIle: 4.463 ± 0.323
4.202LeuLys: 4.202 ± 0.346
4.753LeuLeu: 4.753 ± 0.406
1.71LeuMet: 1.71 ± 0.274
3.13LeuAsn: 3.13 ± 0.301
2.753LeuPro: 2.753 ± 0.254
1.942LeuGln: 1.942 ± 0.271
4.347LeuArg: 4.347 ± 0.336
4.956LeuSer: 4.956 ± 0.451
4.724LeuThr: 4.724 ± 0.399
4.347LeuVal: 4.347 ± 0.495
1.333LeuTrp: 1.333 ± 0.188
2.463LeuTyr: 2.463 ± 0.277
0.0LeuXaa: 0.0 ± 0.0
Met
2.782MetAla: 2.782 ± 0.317
0.29MetCys: 0.29 ± 0.091
1.565MetAsp: 1.565 ± 0.2
1.333MetGlu: 1.333 ± 0.2
0.811MetPhe: 0.811 ± 0.174
2.0MetGly: 2.0 ± 0.28
0.551MetHis: 0.551 ± 0.141
1.275MetIle: 1.275 ± 0.193
1.855MetLys: 1.855 ± 0.211
1.942MetLeu: 1.942 ± 0.222
0.609MetMet: 0.609 ± 0.137
1.159MetAsn: 1.159 ± 0.207
1.246MetPro: 1.246 ± 0.216
1.072MetGln: 1.072 ± 0.264
2.174MetArg: 2.174 ± 0.221
2.145MetSer: 2.145 ± 0.303
2.434MetThr: 2.434 ± 0.272
1.855MetVal: 1.855 ± 0.256
0.406MetTrp: 0.406 ± 0.117
0.725MetTyr: 0.725 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
3.652AsnAla: 3.652 ± 0.345
0.29AsnCys: 0.29 ± 0.092
2.811AsnAsp: 2.811 ± 0.269
2.898AsnGlu: 2.898 ± 0.332
1.623AsnPhe: 1.623 ± 0.188
3.999AsnGly: 3.999 ± 0.375
1.072AsnHis: 1.072 ± 0.186
2.232AsnIle: 2.232 ± 0.222
3.275AsnLys: 3.275 ± 0.373
3.71AsnLeu: 3.71 ± 0.353
1.014AsnMet: 1.014 ± 0.174
2.087AsnAsn: 2.087 ± 0.272
1.913AsnPro: 1.913 ± 0.266
1.304AsnGln: 1.304 ± 0.216
1.971AsnArg: 1.971 ± 0.25
2.347AsnSer: 2.347 ± 0.339
2.985AsnThr: 2.985 ± 0.411
2.927AsnVal: 2.927 ± 0.324
0.754AsnTrp: 0.754 ± 0.149
1.391AsnTyr: 1.391 ± 0.206
0.0AsnXaa: 0.0 ± 0.0
Pro
2.985ProAla: 2.985 ± 0.334
0.406ProCys: 0.406 ± 0.11
2.753ProAsp: 2.753 ± 0.352
3.043ProGlu: 3.043 ± 0.355
1.536ProPhe: 1.536 ± 0.242
3.391ProGly: 3.391 ± 0.329
0.58ProHis: 0.58 ± 0.13
1.623ProIle: 1.623 ± 0.191
2.492ProLys: 2.492 ± 0.351
2.261ProLeu: 2.261 ± 0.246
0.956ProMet: 0.956 ± 0.179
1.71ProAsn: 1.71 ± 0.266
1.362ProPro: 1.362 ± 0.274
0.927ProGln: 0.927 ± 0.17
2.029ProArg: 2.029 ± 0.271
2.087ProSer: 2.087 ± 0.424
2.232ProThr: 2.232 ± 0.322
3.739ProVal: 3.739 ± 0.312
0.464ProTrp: 0.464 ± 0.123
1.101ProTyr: 1.101 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
2.956GlnAla: 2.956 ± 0.428
0.493GlnCys: 0.493 ± 0.136
1.304GlnAsp: 1.304 ± 0.18
2.666GlnGlu: 2.666 ± 0.35
1.391GlnPhe: 1.391 ± 0.238
1.855GlnGly: 1.855 ± 0.207
0.522GlnHis: 0.522 ± 0.111
1.565GlnIle: 1.565 ± 0.23
2.174GlnLys: 2.174 ± 0.327
2.753GlnLeu: 2.753 ± 0.269
1.043GlnMet: 1.043 ± 0.175
1.362GlnAsn: 1.362 ± 0.209
0.869GlnPro: 0.869 ± 0.135
0.956GlnGln: 0.956 ± 0.316
2.058GlnArg: 2.058 ± 0.354
1.71GlnSer: 1.71 ± 0.176
1.71GlnThr: 1.71 ± 0.267
2.174GlnVal: 2.174 ± 0.244
0.464GlnTrp: 0.464 ± 0.109
1.246GlnTyr: 1.246 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
4.521ArgAla: 4.521 ± 0.454
0.58ArgCys: 0.58 ± 0.13
3.391ArgAsp: 3.391 ± 0.337
3.797ArgGlu: 3.797 ± 0.358
2.637ArgPhe: 2.637 ± 0.35
4.231ArgGly: 4.231 ± 0.302
1.188ArgHis: 1.188 ± 0.211
2.956ArgIle: 2.956 ± 0.328
4.347ArgLys: 4.347 ± 0.51
4.086ArgLeu: 4.086 ± 0.352
1.942ArgMet: 1.942 ± 0.279
2.84ArgAsn: 2.84 ± 0.238
1.884ArgPro: 1.884 ± 0.231
2.0ArgGln: 2.0 ± 0.25
3.623ArgArg: 3.623 ± 0.417
2.956ArgSer: 2.956 ± 0.283
2.985ArgThr: 2.985 ± 0.29
4.144ArgVal: 4.144 ± 0.397
1.246ArgTrp: 1.246 ± 0.166
2.492ArgTyr: 2.492 ± 0.292
0.0ArgXaa: 0.0 ± 0.0
Ser
4.637SerAla: 4.637 ± 0.555
0.551SerCys: 0.551 ± 0.127
3.855SerAsp: 3.855 ± 0.363
3.739SerGlu: 3.739 ± 0.366
2.347SerPhe: 2.347 ± 0.282
5.738SerGly: 5.738 ± 0.539
1.304SerHis: 1.304 ± 0.171
3.188SerIle: 3.188 ± 0.356
3.594SerLys: 3.594 ± 0.36
4.724SerLeu: 4.724 ± 0.367
1.768SerMet: 1.768 ± 0.241
2.55SerAsn: 2.55 ± 0.284
2.145SerPro: 2.145 ± 0.333
1.797SerGln: 1.797 ± 0.214
2.898SerArg: 2.898 ± 0.289
3.42SerSer: 3.42 ± 0.521
3.652SerThr: 3.652 ± 0.731
4.231SerVal: 4.231 ± 0.414
1.159SerTrp: 1.159 ± 0.204
1.942SerTyr: 1.942 ± 0.296
0.0SerXaa: 0.0 ± 0.0
Thr
5.043ThrAla: 5.043 ± 0.587
0.725ThrCys: 0.725 ± 0.13
3.999ThrAsp: 3.999 ± 0.321
4.289ThrGlu: 4.289 ± 0.421
2.608ThrPhe: 2.608 ± 0.283
4.84ThrGly: 4.84 ± 0.887
0.985ThrHis: 0.985 ± 0.173
3.883ThrIle: 3.883 ± 0.414
3.536ThrLys: 3.536 ± 0.347
3.768ThrLeu: 3.768 ± 0.338
1.159ThrMet: 1.159 ± 0.228
2.376ThrAsn: 2.376 ± 0.545
2.985ThrPro: 2.985 ± 0.471
1.884ThrGln: 1.884 ± 0.262
2.782ThrArg: 2.782 ± 0.317
3.797ThrSer: 3.797 ± 0.363
3.797ThrThr: 3.797 ± 0.882
4.84ThrVal: 4.84 ± 0.603
1.304ThrTrp: 1.304 ± 0.182
2.029ThrTyr: 2.029 ± 0.231
0.0ThrXaa: 0.0 ± 0.0
Val
5.101ValAla: 5.101 ± 0.359
0.754ValCys: 0.754 ± 0.135
5.101ValAsp: 5.101 ± 0.504
5.043ValGlu: 5.043 ± 0.444
2.666ValPhe: 2.666 ± 0.277
3.912ValGly: 3.912 ± 0.298
1.42ValHis: 1.42 ± 0.22
4.115ValIle: 4.115 ± 0.363
4.608ValLys: 4.608 ± 0.397
4.753ValLeu: 4.753 ± 0.388
1.826ValMet: 1.826 ± 0.181
2.521ValAsn: 2.521 ± 0.334
2.956ValPro: 2.956 ± 0.331
2.55ValGln: 2.55 ± 0.272
4.956ValArg: 4.956 ± 0.29
5.188ValSer: 5.188 ± 0.398
4.318ValThr: 4.318 ± 0.613
5.275ValVal: 5.275 ± 0.426
1.246ValTrp: 1.246 ± 0.212
3.275ValTyr: 3.275 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
1.623TrpAla: 1.623 ± 0.213
0.174TrpCys: 0.174 ± 0.073
1.159TrpAsp: 1.159 ± 0.182
1.652TrpGlu: 1.652 ± 0.242
0.667TrpPhe: 0.667 ± 0.125
1.362TrpGly: 1.362 ± 0.186
0.406TrpHis: 0.406 ± 0.118
0.696TrpIle: 0.696 ± 0.143
1.391TrpLys: 1.391 ± 0.235
1.71TrpLeu: 1.71 ± 0.289
0.638TrpMet: 0.638 ± 0.167
1.072TrpAsn: 1.072 ± 0.178
0.551TrpPro: 0.551 ± 0.134
0.638TrpGln: 0.638 ± 0.138
0.869TrpArg: 0.869 ± 0.158
0.956TrpSer: 0.956 ± 0.165
1.188TrpThr: 1.188 ± 0.241
0.869TrpVal: 0.869 ± 0.169
0.493TrpTrp: 0.493 ± 0.131
0.782TrpTyr: 0.782 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.985TyrAla: 2.985 ± 0.304
0.319TyrCys: 0.319 ± 0.109
2.724TyrAsp: 2.724 ± 0.383
2.608TyrGlu: 2.608 ± 0.334
1.275TyrPhe: 1.275 ± 0.208
3.217TyrGly: 3.217 ± 0.324
0.522TyrHis: 0.522 ± 0.138
1.391TyrIle: 1.391 ± 0.151
1.942TyrLys: 1.942 ± 0.257
3.13TyrLeu: 3.13 ± 0.302
0.84TyrMet: 0.84 ± 0.168
1.507TyrAsn: 1.507 ± 0.208
1.449TyrPro: 1.449 ± 0.267
1.246TyrGln: 1.246 ± 0.19
2.116TyrArg: 2.116 ± 0.213
2.55TyrSer: 2.55 ± 0.287
2.666TyrThr: 2.666 ± 0.273
2.782TyrVal: 2.782 ± 0.398
0.638TyrTrp: 0.638 ± 0.158
1.246TyrTyr: 1.246 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 212 proteins (34506 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski