Amino acid dipepetide frequency for Streptomyces phage Moab

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.831AlaAla: 7.831 ± 1.155
0.594AlaCys: 0.594 ± 0.152
4.078AlaAsp: 4.078 ± 0.296
5.914AlaGlu: 5.914 ± 0.553
3.376AlaPhe: 3.376 ± 0.297
6.805AlaGly: 6.805 ± 0.64
1.485AlaHis: 1.485 ± 0.196
3.808AlaIle: 3.808 ± 0.353
4.591AlaLys: 4.591 ± 0.487
5.941AlaLeu: 5.941 ± 0.486
2.43AlaMet: 2.43 ± 0.35
3.781AlaAsn: 3.781 ± 0.408
2.646AlaPro: 2.646 ± 0.305
2.862AlaGln: 2.862 ± 0.444
4.213AlaArg: 4.213 ± 0.478
4.753AlaSer: 4.753 ± 0.539
5.509AlaThr: 5.509 ± 0.575
5.779AlaVal: 5.779 ± 0.38
1.458AlaTrp: 1.458 ± 0.238
2.619AlaTyr: 2.619 ± 0.269
0.0AlaXaa: 0.0 ± 0.0
Cys
0.621CysAla: 0.621 ± 0.136
0.297CysCys: 0.297 ± 0.105
0.81CysAsp: 0.81 ± 0.166
0.621CysGlu: 0.621 ± 0.148
0.351CysPhe: 0.351 ± 0.115
1.269CysGly: 1.269 ± 0.243
0.27CysHis: 0.27 ± 0.091
0.405CysIle: 0.405 ± 0.112
0.783CysLys: 0.783 ± 0.171
0.729CysLeu: 0.729 ± 0.139
0.27CysMet: 0.27 ± 0.095
0.594CysAsn: 0.594 ± 0.155
0.621CysPro: 0.621 ± 0.154
0.351CysGln: 0.351 ± 0.097
0.513CysArg: 0.513 ± 0.123
0.729CysSer: 0.729 ± 0.156
0.378CysThr: 0.378 ± 0.1
0.621CysVal: 0.621 ± 0.141
0.135CysTrp: 0.135 ± 0.064
0.432CysTyr: 0.432 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
5.239AspAla: 5.239 ± 0.442
0.972AspCys: 0.972 ± 0.18
3.862AspAsp: 3.862 ± 0.4
5.482AspGlu: 5.482 ± 0.494
2.916AspPhe: 2.916 ± 0.394
5.698AspGly: 5.698 ± 0.459
1.053AspHis: 1.053 ± 0.202
3.051AspIle: 3.051 ± 0.278
3.349AspLys: 3.349 ± 0.304
4.564AspLeu: 4.564 ± 0.355
2.025AspMet: 2.025 ± 0.257
3.133AspAsn: 3.133 ± 0.329
2.133AspPro: 2.133 ± 0.27
1.62AspGln: 1.62 ± 0.187
2.889AspArg: 2.889 ± 0.342
3.943AspSer: 3.943 ± 0.343
3.916AspThr: 3.916 ± 0.439
4.861AspVal: 4.861 ± 0.508
1.215AspTrp: 1.215 ± 0.145
2.916AspTyr: 2.916 ± 0.258
0.0AspXaa: 0.0 ± 0.0
Glu
5.05GluAla: 5.05 ± 0.509
0.54GluCys: 0.54 ± 0.16
4.348GluAsp: 4.348 ± 0.4
5.239GluGlu: 5.239 ± 0.63
3.024GluPhe: 3.024 ± 0.323
3.673GluGly: 3.673 ± 0.311
1.377GluHis: 1.377 ± 0.25
4.186GluIle: 4.186 ± 0.472
4.645GluLys: 4.645 ± 0.399
5.104GluLeu: 5.104 ± 0.472
2.295GluMet: 2.295 ± 0.293
2.862GluAsn: 2.862 ± 0.311
1.701GluPro: 1.701 ± 0.237
2.457GluGln: 2.457 ± 0.247
3.835GluArg: 3.835 ± 0.324
3.862GluSer: 3.862 ± 0.369
3.592GluThr: 3.592 ± 0.337
4.861GluVal: 4.861 ± 0.363
1.404GluTrp: 1.404 ± 0.201
3.511GluTyr: 3.511 ± 0.392
0.0GluXaa: 0.0 ± 0.0
Phe
3.133PheAla: 3.133 ± 0.344
0.432PheCys: 0.432 ± 0.1
3.268PheAsp: 3.268 ± 0.294
2.403PheGlu: 2.403 ± 0.275
1.431PhePhe: 1.431 ± 0.193
2.889PheGly: 2.889 ± 0.297
0.891PheHis: 0.891 ± 0.175
1.647PheIle: 1.647 ± 0.211
2.349PheLys: 2.349 ± 0.271
2.862PheLeu: 2.862 ± 0.376
0.81PheMet: 0.81 ± 0.156
1.944PheAsn: 1.944 ± 0.242
1.107PhePro: 1.107 ± 0.186
0.972PheGln: 0.972 ± 0.177
1.863PheArg: 1.863 ± 0.208
3.133PheSer: 3.133 ± 0.329
2.079PheThr: 2.079 ± 0.206
3.024PheVal: 3.024 ± 0.273
0.675PheTrp: 0.675 ± 0.121
1.755PheTyr: 1.755 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
5.455GlyAla: 5.455 ± 0.56
0.675GlyCys: 0.675 ± 0.146
5.023GlyAsp: 5.023 ± 0.384
4.105GlyGlu: 4.105 ± 0.379
3.457GlyPhe: 3.457 ± 0.279
6.049GlyGly: 6.049 ± 0.513
1.593GlyHis: 1.593 ± 0.267
4.915GlyIle: 4.915 ± 0.381
4.483GlyLys: 4.483 ± 0.33
5.752GlyLeu: 5.752 ± 0.46
2.862GlyMet: 2.862 ± 0.209
4.24GlyAsn: 4.24 ± 0.376
2.7GlyPro: 2.7 ± 0.359
2.457GlyGln: 2.457 ± 0.29
4.375GlyArg: 4.375 ± 0.378
5.293GlySer: 5.293 ± 0.483
5.212GlyThr: 5.212 ± 0.727
5.725GlyVal: 5.725 ± 0.504
2.16GlyTrp: 2.16 ± 0.279
3.214GlyTyr: 3.214 ± 0.287
0.0GlyXaa: 0.0 ± 0.0
His
1.296HisAla: 1.296 ± 0.154
0.216HisCys: 0.216 ± 0.082
1.296HisAsp: 1.296 ± 0.215
1.161HisGlu: 1.161 ± 0.232
0.783HisPhe: 0.783 ± 0.179
1.593HisGly: 1.593 ± 0.211
0.297HisHis: 0.297 ± 0.087
0.81HisIle: 0.81 ± 0.137
1.134HisLys: 1.134 ± 0.193
1.215HisLeu: 1.215 ± 0.207
0.459HisMet: 0.459 ± 0.125
0.999HisAsn: 0.999 ± 0.166
0.756HisPro: 0.756 ± 0.131
0.486HisGln: 0.486 ± 0.132
1.377HisArg: 1.377 ± 0.219
0.918HisSer: 0.918 ± 0.148
0.891HisThr: 0.891 ± 0.157
1.512HisVal: 1.512 ± 0.241
0.351HisTrp: 0.351 ± 0.086
0.891HisTyr: 0.891 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
4.348IleAla: 4.348 ± 0.394
0.702IleCys: 0.702 ± 0.145
4.132IleAsp: 4.132 ± 0.434
4.294IleGlu: 4.294 ± 0.37
1.188IlePhe: 1.188 ± 0.211
3.754IleGly: 3.754 ± 0.304
0.891IleHis: 0.891 ± 0.191
2.511IleIle: 2.511 ± 0.287
3.187IleLys: 3.187 ± 0.291
3.997IleLeu: 3.997 ± 0.39
1.431IleMet: 1.431 ± 0.152
2.16IleAsn: 2.16 ± 0.244
2.403IlePro: 2.403 ± 0.302
2.16IleGln: 2.16 ± 0.271
3.457IleArg: 3.457 ± 0.37
3.16IleSer: 3.16 ± 0.276
3.079IleThr: 3.079 ± 0.353
3.835IleVal: 3.835 ± 0.344
0.81IleTrp: 0.81 ± 0.145
1.917IleTyr: 1.917 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
5.266LysAla: 5.266 ± 0.456
0.756LysCys: 0.756 ± 0.156
3.241LysAsp: 3.241 ± 0.348
3.349LysGlu: 3.349 ± 0.386
2.268LysPhe: 2.268 ± 0.257
4.591LysGly: 4.591 ± 0.396
0.999LysHis: 0.999 ± 0.196
3.511LysIle: 3.511 ± 0.291
4.105LysLys: 4.105 ± 0.476
3.457LysLeu: 3.457 ± 0.321
1.917LysMet: 1.917 ± 0.247
3.16LysAsn: 3.16 ± 0.36
2.538LysPro: 2.538 ± 0.245
1.971LysGln: 1.971 ± 0.208
4.078LysArg: 4.078 ± 0.385
3.349LysSer: 3.349 ± 0.341
3.997LysThr: 3.997 ± 0.353
4.672LysVal: 4.672 ± 0.414
1.215LysTrp: 1.215 ± 0.2
2.268LysTyr: 2.268 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
5.671LeuAla: 5.671 ± 0.46
0.729LeuCys: 0.729 ± 0.165
5.644LeuAsp: 5.644 ± 0.443
5.023LeuGlu: 5.023 ± 0.364
2.484LeuPhe: 2.484 ± 0.298
5.59LeuGly: 5.59 ± 0.406
1.269LeuHis: 1.269 ± 0.198
3.7LeuIle: 3.7 ± 0.388
4.699LeuLys: 4.699 ± 0.399
4.294LeuLeu: 4.294 ± 0.34
1.566LeuMet: 1.566 ± 0.216
3.187LeuAsn: 3.187 ± 0.325
2.322LeuPro: 2.322 ± 0.247
2.538LeuGln: 2.538 ± 0.273
3.322LeuArg: 3.322 ± 0.327
4.456LeuSer: 4.456 ± 0.359
4.429LeuThr: 4.429 ± 0.384
4.483LeuVal: 4.483 ± 0.345
1.296LeuTrp: 1.296 ± 0.218
2.619LeuTyr: 2.619 ± 0.265
0.0LeuXaa: 0.0 ± 0.0
Met
2.484MetAla: 2.484 ± 0.267
0.297MetCys: 0.297 ± 0.092
1.647MetAsp: 1.647 ± 0.217
1.701MetGlu: 1.701 ± 0.225
0.702MetPhe: 0.702 ± 0.17
1.755MetGly: 1.755 ± 0.192
0.648MetHis: 0.648 ± 0.142
1.458MetIle: 1.458 ± 0.222
1.89MetLys: 1.89 ± 0.253
1.89MetLeu: 1.89 ± 0.221
0.729MetMet: 0.729 ± 0.134
1.701MetAsn: 1.701 ± 0.256
1.188MetPro: 1.188 ± 0.192
0.891MetGln: 0.891 ± 0.193
1.62MetArg: 1.62 ± 0.191
1.998MetSer: 1.998 ± 0.219
2.052MetThr: 2.052 ± 0.223
1.917MetVal: 1.917 ± 0.241
0.351MetTrp: 0.351 ± 0.088
1.107MetTyr: 1.107 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.943AsnAla: 3.943 ± 0.406
0.378AsnCys: 0.378 ± 0.114
2.835AsnAsp: 2.835 ± 0.254
3.187AsnGlu: 3.187 ± 0.321
1.647AsnPhe: 1.647 ± 0.219
4.807AsnGly: 4.807 ± 0.435
0.729AsnHis: 0.729 ± 0.138
2.511AsnIle: 2.511 ± 0.258
2.484AsnLys: 2.484 ± 0.324
3.484AsnLeu: 3.484 ± 0.381
1.161AsnMet: 1.161 ± 0.17
1.863AsnAsn: 1.863 ± 0.307
2.106AsnPro: 2.106 ± 0.247
1.431AsnGln: 1.431 ± 0.237
2.511AsnArg: 2.511 ± 0.249
2.511AsnSer: 2.511 ± 0.32
2.862AsnThr: 2.862 ± 0.495
3.943AsnVal: 3.943 ± 0.357
0.675AsnTrp: 0.675 ± 0.135
2.025AsnTyr: 2.025 ± 0.235
0.0AsnXaa: 0.0 ± 0.0
Pro
2.7ProAla: 2.7 ± 0.258
0.27ProCys: 0.27 ± 0.084
2.889ProAsp: 2.889 ± 0.329
2.781ProGlu: 2.781 ± 0.333
1.377ProPhe: 1.377 ± 0.214
2.808ProGly: 2.808 ± 0.277
0.702ProHis: 0.702 ± 0.12
1.674ProIle: 1.674 ± 0.211
2.133ProLys: 2.133 ± 0.289
2.133ProLeu: 2.133 ± 0.215
0.54ProMet: 0.54 ± 0.099
1.998ProAsn: 1.998 ± 0.265
1.215ProPro: 1.215 ± 0.251
1.215ProGln: 1.215 ± 0.198
1.998ProArg: 1.998 ± 0.26
2.403ProSer: 2.403 ± 0.408
2.484ProThr: 2.484 ± 0.3
3.051ProVal: 3.051 ± 0.265
0.621ProTrp: 0.621 ± 0.139
1.242ProTyr: 1.242 ± 0.191
0.0ProXaa: 0.0 ± 0.0
Gln
2.565GlnAla: 2.565 ± 0.383
0.324GlnCys: 0.324 ± 0.086
1.458GlnAsp: 1.458 ± 0.212
2.16GlnGlu: 2.16 ± 0.238
1.296GlnPhe: 1.296 ± 0.196
2.727GlnGly: 2.727 ± 0.391
0.459GlnHis: 0.459 ± 0.129
1.971GlnIle: 1.971 ± 0.262
2.322GlnLys: 2.322 ± 0.31
2.511GlnLeu: 2.511 ± 0.34
0.918GlnMet: 0.918 ± 0.159
1.431GlnAsn: 1.431 ± 0.191
1.053GlnPro: 1.053 ± 0.178
1.053GlnGln: 1.053 ± 0.225
1.674GlnArg: 1.674 ± 0.23
2.052GlnSer: 2.052 ± 0.268
1.431GlnThr: 1.431 ± 0.225
2.295GlnVal: 2.295 ± 0.28
0.702GlnTrp: 0.702 ± 0.127
1.242GlnTyr: 1.242 ± 0.148
0.0GlnXaa: 0.0 ± 0.0
Arg
4.24ArgAla: 4.24 ± 0.44
0.486ArgCys: 0.486 ± 0.117
3.079ArgAsp: 3.079 ± 0.35
3.484ArgGlu: 3.484 ± 0.323
2.376ArgPhe: 2.376 ± 0.233
3.808ArgGly: 3.808 ± 0.345
0.999ArgHis: 0.999 ± 0.158
3.241ArgIle: 3.241 ± 0.301
4.375ArgLys: 4.375 ± 0.434
3.727ArgLeu: 3.727 ± 0.323
1.755ArgMet: 1.755 ± 0.21
2.592ArgAsn: 2.592 ± 0.242
2.403ArgPro: 2.403 ± 0.269
1.62ArgGln: 1.62 ± 0.239
3.835ArgArg: 3.835 ± 0.465
2.646ArgSer: 2.646 ± 0.274
2.673ArgThr: 2.673 ± 0.336
4.051ArgVal: 4.051 ± 0.357
0.972ArgTrp: 0.972 ± 0.183
2.484ArgTyr: 2.484 ± 0.284
0.0ArgXaa: 0.0 ± 0.0
Ser
5.104SerAla: 5.104 ± 0.639
0.729SerCys: 0.729 ± 0.148
3.403SerAsp: 3.403 ± 0.329
3.511SerGlu: 3.511 ± 0.343
2.538SerPhe: 2.538 ± 0.282
5.671SerGly: 5.671 ± 0.774
0.999SerHis: 0.999 ± 0.167
3.511SerIle: 3.511 ± 0.313
3.565SerLys: 3.565 ± 0.345
4.078SerLeu: 4.078 ± 0.433
1.971SerMet: 1.971 ± 0.263
2.727SerAsn: 2.727 ± 0.326
2.268SerPro: 2.268 ± 0.34
2.052SerGln: 2.052 ± 0.31
3.16SerArg: 3.16 ± 0.291
4.132SerSer: 4.132 ± 0.471
4.294SerThr: 4.294 ± 0.461
4.375SerVal: 4.375 ± 0.384
1.539SerTrp: 1.539 ± 0.208
2.106SerTyr: 2.106 ± 0.238
0.0SerXaa: 0.0 ± 0.0
Thr
5.401ThrAla: 5.401 ± 0.823
0.513ThrCys: 0.513 ± 0.115
4.267ThrAsp: 4.267 ± 0.399
4.186ThrGlu: 4.186 ± 0.434
2.592ThrPhe: 2.592 ± 0.278
5.914ThrGly: 5.914 ± 0.815
0.999ThrHis: 0.999 ± 0.194
3.808ThrIle: 3.808 ± 0.376
2.7ThrLys: 2.7 ± 0.259
4.726ThrLeu: 4.726 ± 0.394
1.242ThrMet: 1.242 ± 0.186
2.754ThrAsn: 2.754 ± 0.342
2.565ThrPro: 2.565 ± 0.345
1.728ThrGln: 1.728 ± 0.272
2.943ThrArg: 2.943 ± 0.26
3.538ThrSer: 3.538 ± 0.417
4.321ThrThr: 4.321 ± 0.742
5.374ThrVal: 5.374 ± 0.422
1.107ThrTrp: 1.107 ± 0.161
2.106ThrTyr: 2.106 ± 0.299
0.0ThrXaa: 0.0 ± 0.0
Val
5.671ValAla: 5.671 ± 0.322
0.999ValCys: 0.999 ± 0.203
5.428ValAsp: 5.428 ± 0.386
4.645ValGlu: 4.645 ± 0.437
2.943ValPhe: 2.943 ± 0.268
5.293ValGly: 5.293 ± 0.43
1.512ValHis: 1.512 ± 0.203
3.7ValIle: 3.7 ± 0.354
4.483ValLys: 4.483 ± 0.373
4.699ValLeu: 4.699 ± 0.365
2.214ValMet: 2.214 ± 0.259
2.781ValAsn: 2.781 ± 0.259
2.511ValPro: 2.511 ± 0.344
1.836ValGln: 1.836 ± 0.209
4.159ValArg: 4.159 ± 0.389
5.158ValSer: 5.158 ± 0.35
5.536ValThr: 5.536 ± 0.58
5.266ValVal: 5.266 ± 0.514
1.35ValTrp: 1.35 ± 0.219
3.457ValTyr: 3.457 ± 0.321
0.0ValXaa: 0.0 ± 0.0
Trp
1.323TrpAla: 1.323 ± 0.241
0.324TrpCys: 0.324 ± 0.107
1.404TrpAsp: 1.404 ± 0.188
1.485TrpGlu: 1.485 ± 0.203
0.864TrpPhe: 0.864 ± 0.154
1.431TrpGly: 1.431 ± 0.209
0.567TrpHis: 0.567 ± 0.135
0.864TrpIle: 0.864 ± 0.139
1.08TrpLys: 1.08 ± 0.175
1.539TrpLeu: 1.539 ± 0.238
0.486TrpMet: 0.486 ± 0.12
0.999TrpAsn: 0.999 ± 0.155
0.486TrpPro: 0.486 ± 0.137
0.459TrpGln: 0.459 ± 0.104
0.81TrpArg: 0.81 ± 0.151
1.107TrpSer: 1.107 ± 0.149
1.377TrpThr: 1.377 ± 0.244
1.215TrpVal: 1.215 ± 0.209
0.459TrpTrp: 0.459 ± 0.131
0.891TrpTyr: 0.891 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.133TyrAla: 3.133 ± 0.308
0.567TyrCys: 0.567 ± 0.132
2.727TyrAsp: 2.727 ± 0.333
2.835TyrGlu: 2.835 ± 0.33
1.053TyrPhe: 1.053 ± 0.186
3.619TyrGly: 3.619 ± 0.282
0.702TyrHis: 0.702 ± 0.167
2.133TyrIle: 2.133 ± 0.27
2.349TyrLys: 2.349 ± 0.271
2.619TyrLeu: 2.619 ± 0.29
0.918TyrMet: 0.918 ± 0.165
2.187TyrAsn: 2.187 ± 0.254
1.512TyrPro: 1.512 ± 0.211
1.485TyrGln: 1.485 ± 0.162
2.241TyrArg: 2.241 ± 0.265
2.592TyrSer: 2.592 ± 0.336
2.673TyrThr: 2.673 ± 0.266
2.781TyrVal: 2.781 ± 0.328
0.675TyrTrp: 0.675 ± 0.129
2.025TyrTyr: 2.025 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 233 proteins (37032 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski