Amino acid dipepetide frequency for Pseudomonas phage pf16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.157AlaAla: 7.157 ± 0.499
0.841AlaCys: 0.841 ± 0.14
5.209AlaAsp: 5.209 ± 0.237
5.291AlaGlu: 5.291 ± 0.402
3.425AlaPhe: 3.425 ± 0.259
4.962AlaGly: 4.962 ± 0.339
1.107AlaHis: 1.107 ± 0.143
4.716AlaIle: 4.716 ± 0.294
5.414AlaLys: 5.414 ± 0.375
7.013AlaLeu: 7.013 ± 0.414
2.092AlaMet: 2.092 ± 0.203
3.671AlaAsn: 3.671 ± 0.295
2.973AlaPro: 2.973 ± 0.339
3.158AlaGln: 3.158 ± 0.248
3.732AlaArg: 3.732 ± 0.233
4.06AlaSer: 4.06 ± 0.425
4.88AlaThr: 4.88 ± 0.339
6.829AlaVal: 6.829 ± 0.386
1.066AlaTrp: 1.066 ± 0.129
2.502AlaTyr: 2.502 ± 0.239
0.0AlaXaa: 0.0 ± 0.0
Cys
0.943CysAla: 0.943 ± 0.141
0.164CysCys: 0.164 ± 0.06
0.677CysAsp: 0.677 ± 0.12
0.574CysGlu: 0.574 ± 0.109
0.328CysPhe: 0.328 ± 0.086
0.943CysGly: 0.943 ± 0.162
0.267CysHis: 0.267 ± 0.07
0.513CysIle: 0.513 ± 0.109
0.697CysLys: 0.697 ± 0.117
0.595CysLeu: 0.595 ± 0.112
0.472CysMet: 0.472 ± 0.101
0.472CysAsn: 0.472 ± 0.111
0.513CysPro: 0.513 ± 0.1
0.472CysGln: 0.472 ± 0.118
0.636CysArg: 0.636 ± 0.103
0.841CysSer: 0.841 ± 0.14
0.779CysThr: 0.779 ± 0.127
0.779CysVal: 0.779 ± 0.133
0.123CysTrp: 0.123 ± 0.05
0.472CysTyr: 0.472 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
5.496AspAla: 5.496 ± 0.35
0.513AspCys: 0.513 ± 0.105
4.163AspAsp: 4.163 ± 0.295
4.511AspGlu: 4.511 ± 0.332
2.891AspPhe: 2.891 ± 0.218
4.47AspGly: 4.47 ± 0.251
1.128AspHis: 1.128 ± 0.147
3.978AspIle: 3.978 ± 0.275
4.204AspLys: 4.204 ± 0.268
5.168AspLeu: 5.168 ± 0.344
1.558AspMet: 1.558 ± 0.166
2.543AspAsn: 2.543 ± 0.213
2.379AspPro: 2.379 ± 0.271
2.051AspGln: 2.051 ± 0.204
2.44AspArg: 2.44 ± 0.21
3.753AspSer: 3.753 ± 0.276
3.425AspThr: 3.425 ± 0.25
4.778AspVal: 4.778 ± 0.323
0.8AspTrp: 0.8 ± 0.129
2.481AspTyr: 2.481 ± 0.241
0.0AspXaa: 0.0 ± 0.0
Glu
5.332GluAla: 5.332 ± 0.412
0.718GluCys: 0.718 ± 0.12
3.342GluAsp: 3.342 ± 0.27
4.265GluGlu: 4.265 ± 0.385
3.26GluPhe: 3.26 ± 0.295
3.794GluGly: 3.794 ± 0.316
1.64GluHis: 1.64 ± 0.204
3.425GluIle: 3.425 ± 0.281
3.301GluLys: 3.301 ± 0.276
6.89GluLeu: 6.89 ± 0.383
2.133GluMet: 2.133 ± 0.241
2.625GluAsn: 2.625 ± 0.279
2.194GluPro: 2.194 ± 0.276
2.481GluGln: 2.481 ± 0.263
3.794GluArg: 3.794 ± 0.303
3.835GluSer: 3.835 ± 0.266
3.527GluThr: 3.527 ± 0.285
5.127GluVal: 5.127 ± 0.296
1.046GluTrp: 1.046 ± 0.128
2.83GluTyr: 2.83 ± 0.287
0.0GluXaa: 0.0 ± 0.0
Phe
3.24PheAla: 3.24 ± 0.239
0.779PheCys: 0.779 ± 0.128
3.753PheAsp: 3.753 ± 0.305
3.014PheGlu: 3.014 ± 0.231
1.599PhePhe: 1.599 ± 0.2
2.256PheGly: 2.256 ± 0.214
1.025PheHis: 1.025 ± 0.171
2.625PheIle: 2.625 ± 0.265
3.671PheLys: 3.671 ± 0.253
2.707PheLeu: 2.707 ± 0.257
1.271PheMet: 1.271 ± 0.168
2.891PheAsn: 2.891 ± 0.235
1.128PhePro: 1.128 ± 0.143
0.984PheGln: 0.984 ± 0.131
1.723PheArg: 1.723 ± 0.199
2.686PheSer: 2.686 ± 0.199
2.727PheThr: 2.727 ± 0.219
3.486PheVal: 3.486 ± 0.309
0.574PheTrp: 0.574 ± 0.109
1.558PheTyr: 1.558 ± 0.164
0.0PheXaa: 0.0 ± 0.0
Gly
4.101GlyAla: 4.101 ± 0.357
0.738GlyCys: 0.738 ± 0.119
3.753GlyAsp: 3.753 ± 0.297
4.491GlyGlu: 4.491 ± 0.361
3.096GlyPhe: 3.096 ± 0.234
4.286GlyGly: 4.286 ± 0.378
1.169GlyHis: 1.169 ± 0.173
3.814GlyIle: 3.814 ± 0.325
4.347GlyLys: 4.347 ± 0.317
5.352GlyLeu: 5.352 ± 0.368
1.805GlyMet: 1.805 ± 0.182
3.342GlyAsn: 3.342 ± 0.329
1.456GlyPro: 1.456 ± 0.198
2.112GlyGln: 2.112 ± 0.234
2.809GlyArg: 2.809 ± 0.227
4.429GlySer: 4.429 ± 0.356
4.839GlyThr: 4.839 ± 0.498
5.516GlyVal: 5.516 ± 0.421
0.902GlyTrp: 0.902 ± 0.149
2.686GlyTyr: 2.686 ± 0.271
0.0GlyXaa: 0.0 ± 0.0
His
1.394HisAla: 1.394 ± 0.175
0.267HisCys: 0.267 ± 0.062
1.169HisAsp: 1.169 ± 0.179
1.148HisGlu: 1.148 ± 0.171
1.23HisPhe: 1.23 ± 0.153
1.271HisGly: 1.271 ± 0.189
0.492HisHis: 0.492 ± 0.102
1.169HisIle: 1.169 ± 0.168
1.292HisLys: 1.292 ± 0.181
1.784HisLeu: 1.784 ± 0.182
0.595HisMet: 0.595 ± 0.112
1.169HisAsn: 1.169 ± 0.165
0.984HisPro: 0.984 ± 0.156
0.636HisGln: 0.636 ± 0.12
0.697HisArg: 0.697 ± 0.113
1.271HisSer: 1.271 ± 0.211
0.718HisThr: 0.718 ± 0.125
1.599HisVal: 1.599 ± 0.241
0.349HisTrp: 0.349 ± 0.093
0.902HisTyr: 0.902 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
4.142IleAla: 4.142 ± 0.253
0.595IleCys: 0.595 ± 0.112
4.532IleAsp: 4.532 ± 0.302
4.327IleGlu: 4.327 ± 0.268
1.764IlePhe: 1.764 ± 0.187
3.65IleGly: 3.65 ± 0.301
1.312IleHis: 1.312 ± 0.212
3.527IleIle: 3.527 ± 0.332
4.634IleLys: 4.634 ± 0.311
4.163IleLeu: 4.163 ± 0.297
1.353IleMet: 1.353 ± 0.154
3.814IleAsn: 3.814 ± 0.255
2.174IlePro: 2.174 ± 0.246
1.907IleGln: 1.907 ± 0.171
2.85IleArg: 2.85 ± 0.302
3.486IleSer: 3.486 ± 0.244
3.835IleThr: 3.835 ± 0.298
4.532IleVal: 4.532 ± 0.297
0.533IleTrp: 0.533 ± 0.108
1.62IleTyr: 1.62 ± 0.173
0.0IleXaa: 0.0 ± 0.0
Lys
5.803LysAla: 5.803 ± 0.368
0.554LysCys: 0.554 ± 0.122
3.404LysAsp: 3.404 ± 0.27
4.245LysGlu: 4.245 ± 0.332
3.096LysPhe: 3.096 ± 0.246
3.999LysGly: 3.999 ± 0.26
2.01LysHis: 2.01 ± 0.208
3.425LysIle: 3.425 ± 0.289
3.712LysLys: 3.712 ± 0.377
5.742LysLeu: 5.742 ± 0.386
2.112LysMet: 2.112 ± 0.228
3.055LysAsn: 3.055 ± 0.235
2.912LysPro: 2.912 ± 0.239
2.666LysGln: 2.666 ± 0.269
3.486LysArg: 3.486 ± 0.287
3.568LysSer: 3.568 ± 0.263
3.63LysThr: 3.63 ± 0.282
5.147LysVal: 5.147 ± 0.336
0.718LysTrp: 0.718 ± 0.124
2.604LysTyr: 2.604 ± 0.212
0.0LysXaa: 0.0 ± 0.0
Leu
6.5LeuAla: 6.5 ± 0.364
1.066LeuCys: 1.066 ± 0.169
4.962LeuAsp: 4.962 ± 0.314
5.311LeuGlu: 5.311 ± 0.359
3.568LeuPhe: 3.568 ± 0.272
4.778LeuGly: 4.778 ± 0.301
1.517LeuHis: 1.517 ± 0.17
4.224LeuIle: 4.224 ± 0.251
6.582LeuLys: 6.582 ± 0.446
5.701LeuLeu: 5.701 ± 0.355
2.297LeuMet: 2.297 ± 0.275
4.409LeuAsn: 4.409 ± 0.339
3.178LeuPro: 3.178 ± 0.251
2.686LeuGln: 2.686 ± 0.194
3.712LeuArg: 3.712 ± 0.3
4.737LeuSer: 4.737 ± 0.28
4.327LeuThr: 4.327 ± 0.299
5.619LeuVal: 5.619 ± 0.316
0.882LeuTrp: 0.882 ± 0.142
3.035LeuTyr: 3.035 ± 0.316
0.0LeuXaa: 0.0 ± 0.0
Met
2.522MetAla: 2.522 ± 0.234
0.554MetCys: 0.554 ± 0.127
1.333MetAsp: 1.333 ± 0.18
1.723MetGlu: 1.723 ± 0.204
1.005MetPhe: 1.005 ± 0.168
1.271MetGly: 1.271 ± 0.171
0.41MetHis: 0.41 ± 0.093
1.415MetIle: 1.415 ± 0.172
2.112MetLys: 2.112 ± 0.235
2.174MetLeu: 2.174 ± 0.217
0.82MetMet: 0.82 ± 0.165
1.333MetAsn: 1.333 ± 0.17
1.046MetPro: 1.046 ± 0.162
1.025MetGln: 1.025 ± 0.124
1.476MetArg: 1.476 ± 0.172
2.358MetSer: 2.358 ± 0.201
1.846MetThr: 1.846 ± 0.179
2.071MetVal: 2.071 ± 0.191
0.205MetTrp: 0.205 ± 0.063
0.923MetTyr: 0.923 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
4.306AsnAla: 4.306 ± 0.344
0.636AsnCys: 0.636 ± 0.151
2.748AsnAsp: 2.748 ± 0.238
3.076AsnGlu: 3.076 ± 0.218
2.194AsnPhe: 2.194 ± 0.222
4.122AsnGly: 4.122 ± 0.311
0.902AsnHis: 0.902 ± 0.147
3.24AsnIle: 3.24 ± 0.274
3.322AsnLys: 3.322 ± 0.258
3.835AsnLeu: 3.835 ± 0.266
1.046AsnMet: 1.046 ± 0.144
2.276AsnAsn: 2.276 ± 0.308
2.44AsnPro: 2.44 ± 0.215
2.092AsnGln: 2.092 ± 0.178
2.461AsnArg: 2.461 ± 0.204
3.158AsnSer: 3.158 ± 0.25
2.666AsnThr: 2.666 ± 0.319
3.773AsnVal: 3.773 ± 0.278
0.636AsnTrp: 0.636 ± 0.108
1.579AsnTyr: 1.579 ± 0.187
0.0AsnXaa: 0.0 ± 0.0
Pro
3.466ProAla: 3.466 ± 0.311
0.369ProCys: 0.369 ± 0.098
2.338ProAsp: 2.338 ± 0.24
3.404ProGlu: 3.404 ± 0.326
1.64ProPhe: 1.64 ± 0.17
2.174ProGly: 2.174 ± 0.218
0.8ProHis: 0.8 ± 0.125
1.969ProIle: 1.969 ± 0.191
1.948ProLys: 1.948 ± 0.249
2.563ProLeu: 2.563 ± 0.213
0.964ProMet: 0.964 ± 0.149
1.456ProAsn: 1.456 ± 0.148
0.984ProPro: 0.984 ± 0.16
1.292ProGln: 1.292 ± 0.172
1.64ProArg: 1.64 ± 0.19
2.666ProSer: 2.666 ± 0.24
2.461ProThr: 2.461 ± 0.223
3.158ProVal: 3.158 ± 0.287
0.39ProTrp: 0.39 ± 0.083
1.661ProTyr: 1.661 ± 0.184
0.0ProXaa: 0.0 ± 0.0
Gln
3.055GlnAla: 3.055 ± 0.318
0.431GlnCys: 0.431 ± 0.087
1.907GlnAsp: 1.907 ± 0.225
2.092GlnGlu: 2.092 ± 0.185
1.928GlnPhe: 1.928 ± 0.19
2.194GlnGly: 2.194 ± 0.235
0.738GlnHis: 0.738 ± 0.117
2.071GlnIle: 2.071 ± 0.19
1.887GlnLys: 1.887 ± 0.21
3.301GlnLeu: 3.301 ± 0.288
1.087GlnMet: 1.087 ± 0.136
1.681GlnAsn: 1.681 ± 0.197
1.476GlnPro: 1.476 ± 0.189
1.743GlnGln: 1.743 ± 0.173
1.866GlnArg: 1.866 ± 0.194
2.133GlnSer: 2.133 ± 0.211
2.071GlnThr: 2.071 ± 0.219
2.358GlnVal: 2.358 ± 0.214
0.513GlnTrp: 0.513 ± 0.099
1.784GlnTyr: 1.784 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
3.404ArgAla: 3.404 ± 0.263
0.533ArgCys: 0.533 ± 0.098
2.953ArgAsp: 2.953 ± 0.252
3.26ArgGlu: 3.26 ± 0.333
1.907ArgPhe: 1.907 ± 0.191
3.568ArgGly: 3.568 ± 0.308
0.984ArgHis: 0.984 ± 0.143
3.445ArgIle: 3.445 ± 0.268
3.055ArgLys: 3.055 ± 0.291
3.507ArgLeu: 3.507 ± 0.282
1.62ArgMet: 1.62 ± 0.195
2.461ArgAsn: 2.461 ± 0.225
1.23ArgPro: 1.23 ± 0.146
1.702ArgGln: 1.702 ± 0.174
2.871ArgArg: 2.871 ± 0.243
2.727ArgSer: 2.727 ± 0.241
2.748ArgThr: 2.748 ± 0.226
3.466ArgVal: 3.466 ± 0.29
0.513ArgTrp: 0.513 ± 0.088
2.071ArgTyr: 2.071 ± 0.236
0.0ArgXaa: 0.0 ± 0.0
Ser
4.655SerAla: 4.655 ± 0.358
0.677SerCys: 0.677 ± 0.136
3.342SerAsp: 3.342 ± 0.226
3.199SerGlu: 3.199 ± 0.297
3.158SerPhe: 3.158 ± 0.256
4.491SerGly: 4.491 ± 0.438
1.128SerHis: 1.128 ± 0.163
3.999SerIle: 3.999 ± 0.291
3.978SerLys: 3.978 ± 0.364
4.86SerLeu: 4.86 ± 0.26
1.456SerMet: 1.456 ± 0.186
3.158SerAsn: 3.158 ± 0.257
2.276SerPro: 2.276 ± 0.213
2.235SerGln: 2.235 ± 0.219
2.502SerArg: 2.502 ± 0.193
3.712SerSer: 3.712 ± 0.292
4.081SerThr: 4.081 ± 0.323
4.368SerVal: 4.368 ± 0.341
0.697SerTrp: 0.697 ± 0.114
2.235SerTyr: 2.235 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
4.675ThrAla: 4.675 ± 0.374
0.533ThrCys: 0.533 ± 0.103
3.589ThrAsp: 3.589 ± 0.259
3.322ThrGlu: 3.322 ± 0.273
2.727ThrPhe: 2.727 ± 0.231
4.511ThrGly: 4.511 ± 0.429
1.066ThrHis: 1.066 ± 0.166
3.917ThrIle: 3.917 ± 0.292
3.301ThrLys: 3.301 ± 0.22
4.716ThrLeu: 4.716 ± 0.323
1.333ThrMet: 1.333 ± 0.201
2.85ThrAsn: 2.85 ± 0.271
2.891ThrPro: 2.891 ± 0.284
2.83ThrGln: 2.83 ± 0.251
2.789ThrArg: 2.789 ± 0.222
3.404ThrSer: 3.404 ± 0.337
4.204ThrThr: 4.204 ± 0.475
5.147ThrVal: 5.147 ± 0.388
0.554ThrTrp: 0.554 ± 0.118
2.317ThrTyr: 2.317 ± 0.288
0.0ThrXaa: 0.0 ± 0.0
Val
6.07ValAla: 6.07 ± 0.338
0.697ValCys: 0.697 ± 0.108
5.906ValAsp: 5.906 ± 0.309
5.414ValGlu: 5.414 ± 0.339
2.891ValPhe: 2.891 ± 0.248
5.065ValGly: 5.065 ± 0.353
1.394ValHis: 1.394 ± 0.2
4.245ValIle: 4.245 ± 0.288
5.393ValLys: 5.393 ± 0.342
5.414ValLeu: 5.414 ± 0.296
2.153ValMet: 2.153 ± 0.199
4.532ValAsn: 4.532 ± 0.399
3.24ValPro: 3.24 ± 0.234
2.399ValGln: 2.399 ± 0.186
3.958ValArg: 3.958 ± 0.272
4.634ValSer: 4.634 ± 0.381
4.614ValThr: 4.614 ± 0.414
5.66ValVal: 5.66 ± 0.426
0.615ValTrp: 0.615 ± 0.109
2.789ValTyr: 2.789 ± 0.202
0.0ValXaa: 0.0 ± 0.0
Trp
0.841TrpAla: 0.841 ± 0.125
0.062TrpCys: 0.062 ± 0.034
0.779TrpAsp: 0.779 ± 0.123
0.697TrpGlu: 0.697 ± 0.107
0.595TrpPhe: 0.595 ± 0.102
0.779TrpGly: 0.779 ± 0.124
0.267TrpHis: 0.267 ± 0.068
0.697TrpIle: 0.697 ± 0.114
0.636TrpLys: 0.636 ± 0.103
1.046TrpLeu: 1.046 ± 0.14
0.369TrpMet: 0.369 ± 0.083
0.677TrpAsn: 0.677 ± 0.11
0.451TrpPro: 0.451 ± 0.083
0.41TrpGln: 0.41 ± 0.094
0.492TrpArg: 0.492 ± 0.093
0.841TrpSer: 0.841 ± 0.129
0.677TrpThr: 0.677 ± 0.124
0.923TrpVal: 0.923 ± 0.162
0.144TrpTrp: 0.144 ± 0.054
0.472TrpTyr: 0.472 ± 0.091
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.994TyrAla: 2.994 ± 0.29
0.554TyrCys: 0.554 ± 0.113
2.871TyrAsp: 2.871 ± 0.227
2.194TyrGlu: 2.194 ± 0.198
1.394TyrPhe: 1.394 ± 0.148
2.461TyrGly: 2.461 ± 0.208
0.779TyrHis: 0.779 ± 0.136
2.338TyrIle: 2.338 ± 0.211
2.399TyrLys: 2.399 ± 0.222
2.563TyrLeu: 2.563 ± 0.246
1.066TyrMet: 1.066 ± 0.17
2.092TyrAsn: 2.092 ± 0.198
1.415TyrPro: 1.415 ± 0.217
1.435TyrGln: 1.435 ± 0.161
2.092TyrArg: 2.092 ± 0.205
1.907TyrSer: 1.907 ± 0.193
2.625TyrThr: 2.625 ± 0.224
2.748TyrVal: 2.748 ± 0.249
0.595TyrTrp: 0.595 ± 0.117
1.599TyrTyr: 1.599 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 237 proteins (48767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski