Amino acid dipepetide frequency for Morganella phage vB_MmoM_MP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.102AlaAla: 2.102 ± 0.208
0.424AlaCys: 0.424 ± 0.084
3.278AlaAsp: 3.278 ± 0.249
4.319AlaGlu: 4.319 ± 0.375
2.102AlaPhe: 2.102 ± 0.211
3.452AlaGly: 3.452 ± 0.333
0.713AlaHis: 0.713 ± 0.107
4.146AlaIle: 4.146 ± 0.337
4.049AlaLys: 4.049 ± 0.274
4.416AlaLeu: 4.416 ± 0.269
1.234AlaMet: 1.234 ± 0.173
2.333AlaAsn: 2.333 ± 0.207
1.639AlaPro: 1.639 ± 0.201
2.063AlaGln: 2.063 ± 0.197
2.082AlaArg: 2.082 ± 0.218
3.799AlaSer: 3.799 ± 0.307
2.738AlaThr: 2.738 ± 0.289
2.622AlaVal: 2.622 ± 0.28
0.733AlaTrp: 0.733 ± 0.123
2.198AlaTyr: 2.198 ± 0.227
0.0AlaXaa: 0.0 ± 0.0
Cys
0.81CysAla: 0.81 ± 0.136
0.212CysCys: 0.212 ± 0.066
0.81CysAsp: 0.81 ± 0.111
1.138CysGlu: 1.138 ± 0.147
0.521CysPhe: 0.521 ± 0.109
0.656CysGly: 0.656 ± 0.14
0.309CysHis: 0.309 ± 0.074
0.848CysIle: 0.848 ± 0.128
0.829CysLys: 0.829 ± 0.15
0.945CysLeu: 0.945 ± 0.164
0.251CysMet: 0.251 ± 0.067
0.578CysAsn: 0.578 ± 0.115
0.578CysPro: 0.578 ± 0.106
0.27CysGln: 0.27 ± 0.072
0.656CysArg: 0.656 ± 0.101
0.964CysSer: 0.964 ± 0.148
0.733CysThr: 0.733 ± 0.116
0.752CysVal: 0.752 ± 0.124
0.116CysTrp: 0.116 ± 0.043
0.636CysTyr: 0.636 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
2.989AspAla: 2.989 ± 0.292
0.636AspCys: 0.636 ± 0.102
4.165AspAsp: 4.165 ± 0.282
5.303AspGlu: 5.303 ± 0.306
3.394AspPhe: 3.394 ± 0.259
4.358AspGly: 4.358 ± 0.3
1.118AspHis: 1.118 ± 0.139
5.553AspIle: 5.553 ± 0.348
5.746AspLys: 5.746 ± 0.39
5.457AspLeu: 5.457 ± 0.336
1.967AspMet: 1.967 ± 0.176
3.664AspAsn: 3.664 ± 0.271
1.813AspPro: 1.813 ± 0.197
2.063AspGln: 2.063 ± 0.245
1.755AspArg: 1.755 ± 0.189
4.069AspSer: 4.069 ± 0.269
3.895AspThr: 3.895 ± 0.252
4.763AspVal: 4.763 ± 0.299
1.35AspTrp: 1.35 ± 0.173
3.644AspTyr: 3.644 ± 0.262
0.0AspXaa: 0.0 ± 0.0
Glu
4.184GluAla: 4.184 ± 0.281
1.253GluCys: 1.253 ± 0.168
4.396GluAsp: 4.396 ± 0.301
5.573GluGlu: 5.573 ± 0.413
3.587GluPhe: 3.587 ± 0.292
3.818GluGly: 3.818 ± 0.279
1.678GluHis: 1.678 ± 0.184
6.286GluIle: 6.286 ± 0.343
5.322GluLys: 5.322 ± 0.379
6.942GluLeu: 6.942 ± 0.345
2.41GluMet: 2.41 ± 0.185
3.895GluAsn: 3.895 ± 0.278
2.005GluPro: 2.005 ± 0.2
2.526GluGln: 2.526 ± 0.235
2.584GluArg: 2.584 ± 0.227
3.972GluSer: 3.972 ± 0.282
4.724GluThr: 4.724 ± 0.356
5.013GluVal: 5.013 ± 0.348
0.926GluTrp: 0.926 ± 0.155
4.435GluTyr: 4.435 ± 0.291
0.0GluXaa: 0.0 ± 0.0
Phe
2.102PheAla: 2.102 ± 0.203
0.598PheCys: 0.598 ± 0.092
3.953PheAsp: 3.953 ± 0.295
4.204PheGlu: 4.204 ± 0.31
1.986PhePhe: 1.986 ± 0.225
2.7PheGly: 2.7 ± 0.247
0.501PheHis: 0.501 ± 0.096
3.509PheIle: 3.509 ± 0.246
4.281PheLys: 4.281 ± 0.274
2.487PheLeu: 2.487 ± 0.229
1.196PheMet: 1.196 ± 0.131
3.182PheAsn: 3.182 ± 0.236
1.292PhePro: 1.292 ± 0.153
1.099PheGln: 1.099 ± 0.15
1.851PheArg: 1.851 ± 0.188
3.047PheSer: 3.047 ± 0.215
2.295PheThr: 2.295 ± 0.167
3.143PheVal: 3.143 ± 0.226
0.617PheTrp: 0.617 ± 0.117
2.295PheTyr: 2.295 ± 0.261
0.0PheXaa: 0.0 ± 0.0
Gly
2.372GlyAla: 2.372 ± 0.25
0.848GlyCys: 0.848 ± 0.133
4.223GlyAsp: 4.223 ± 0.323
4.204GlyGlu: 4.204 ± 0.26
2.815GlyPhe: 2.815 ± 0.242
2.757GlyGly: 2.757 ± 0.244
1.08GlyHis: 1.08 ± 0.141
4.146GlyIle: 4.146 ± 0.255
4.589GlyLys: 4.589 ± 0.366
5.438GlyLeu: 5.438 ± 0.359
1.581GlyMet: 1.581 ± 0.199
2.854GlyAsn: 2.854 ± 0.239
1.35GlyPro: 1.35 ± 0.138
1.986GlyGln: 1.986 ± 0.185
2.603GlyArg: 2.603 ± 0.233
4.011GlySer: 4.011 ± 0.288
3.452GlyThr: 3.452 ± 0.344
3.548GlyVal: 3.548 ± 0.27
0.81GlyTrp: 0.81 ± 0.13
2.68GlyTyr: 2.68 ± 0.206
0.0GlyXaa: 0.0 ± 0.0
His
0.945HisAla: 0.945 ± 0.147
0.27HisCys: 0.27 ± 0.081
1.157HisAsp: 1.157 ± 0.139
1.35HisGlu: 1.35 ± 0.151
1.157HisPhe: 1.157 ± 0.137
0.964HisGly: 0.964 ± 0.161
0.328HisHis: 0.328 ± 0.082
1.408HisIle: 1.408 ± 0.166
1.658HisLys: 1.658 ± 0.187
1.465HisLeu: 1.465 ± 0.155
0.405HisMet: 0.405 ± 0.081
1.253HisAsn: 1.253 ± 0.156
0.964HisPro: 0.964 ± 0.148
0.386HisGln: 0.386 ± 0.082
0.598HisArg: 0.598 ± 0.111
0.926HisSer: 0.926 ± 0.14
0.733HisThr: 0.733 ± 0.113
1.118HisVal: 1.118 ± 0.139
0.251HisTrp: 0.251 ± 0.079
0.81HisTyr: 0.81 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
4.184IleAla: 4.184 ± 0.332
0.964IleCys: 0.964 ± 0.127
5.92IleAsp: 5.92 ± 0.338
6.575IleGlu: 6.575 ± 0.379
3.182IlePhe: 3.182 ± 0.273
3.818IleGly: 3.818 ± 0.282
1.273IleHis: 1.273 ± 0.152
5.804IleIle: 5.804 ± 0.366
7.964IleLys: 7.964 ± 0.397
4.801IleLeu: 4.801 ± 0.335
2.179IleMet: 2.179 ± 0.211
5.476IleAsn: 5.476 ± 0.406
2.584IlePro: 2.584 ± 0.24
2.352IleGln: 2.352 ± 0.202
3.027IleArg: 3.027 ± 0.253
5.071IleSer: 5.071 ± 0.297
4.878IleThr: 4.878 ± 0.316
4.531IleVal: 4.531 ± 0.276
0.598IleTrp: 0.598 ± 0.102
3.066IleTyr: 3.066 ± 0.247
0.0IleXaa: 0.0 ± 0.0
Lys
4.358LysAla: 4.358 ± 0.319
0.887LysCys: 0.887 ± 0.144
6.112LysAsp: 6.112 ± 0.37
6.247LysGlu: 6.247 ± 0.394
4.03LysPhe: 4.03 ± 0.324
4.146LysGly: 4.146 ± 0.255
2.025LysHis: 2.025 ± 0.19
6.787LysIle: 6.787 ± 0.358
5.418LysLys: 5.418 ± 0.381
7.057LysLeu: 7.057 ± 0.446
2.487LysMet: 2.487 ± 0.21
4.801LysAsn: 4.801 ± 0.34
2.777LysPro: 2.777 ± 0.204
2.256LysGln: 2.256 ± 0.22
3.104LysArg: 3.104 ± 0.229
5.148LysSer: 5.148 ± 0.353
5.804LysThr: 5.804 ± 0.343
4.859LysVal: 4.859 ± 0.34
1.176LysTrp: 1.176 ± 0.145
4.57LysTyr: 4.57 ± 0.395
0.0LysXaa: 0.0 ± 0.0
Leu
4.339LeuAla: 4.339 ± 0.332
1.196LeuCys: 1.196 ± 0.162
4.956LeuAsp: 4.956 ± 0.333
5.553LeuGlu: 5.553 ± 0.336
2.873LeuPhe: 2.873 ± 0.233
4.358LeuGly: 4.358 ± 0.332
1.388LeuHis: 1.388 ± 0.181
5.457LeuIle: 5.457 ± 0.292
7.597LeuLys: 7.597 ± 0.385
5.091LeuLeu: 5.091 ± 0.301
2.063LeuMet: 2.063 ± 0.202
4.242LeuAsn: 4.242 ± 0.244
2.834LeuPro: 2.834 ± 0.223
2.449LeuGln: 2.449 ± 0.268
2.854LeuArg: 2.854 ± 0.236
5.573LeuSer: 5.573 ± 0.333
4.917LeuThr: 4.917 ± 0.288
5.129LeuVal: 5.129 ± 0.315
0.713LeuTrp: 0.713 ± 0.121
3.124LeuTyr: 3.124 ± 0.291
0.0LeuXaa: 0.0 ± 0.0
Met
1.35MetAla: 1.35 ± 0.157
0.212MetCys: 0.212 ± 0.063
1.292MetAsp: 1.292 ± 0.164
1.909MetGlu: 1.909 ± 0.202
1.6MetPhe: 1.6 ± 0.178
1.311MetGly: 1.311 ± 0.167
0.521MetHis: 0.521 ± 0.105
1.948MetIle: 1.948 ± 0.202
2.854MetLys: 2.854 ± 0.212
1.774MetLeu: 1.774 ± 0.176
0.81MetMet: 0.81 ± 0.118
1.774MetAsn: 1.774 ± 0.175
0.656MetPro: 0.656 ± 0.111
1.003MetGln: 1.003 ± 0.145
0.598MetArg: 0.598 ± 0.107
1.6MetSer: 1.6 ± 0.145
1.639MetThr: 1.639 ± 0.177
1.408MetVal: 1.408 ± 0.182
0.366MetTrp: 0.366 ± 0.083
1.311MetTyr: 1.311 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
2.873AsnAla: 2.873 ± 0.205
0.617AsnCys: 0.617 ± 0.09
3.297AsnAsp: 3.297 ± 0.237
3.972AsnGlu: 3.972 ± 0.25
2.352AsnPhe: 2.352 ± 0.202
4.589AsnGly: 4.589 ± 0.278
0.964AsnHis: 0.964 ± 0.13
4.416AsnIle: 4.416 ± 0.294
4.84AsnLys: 4.84 ± 0.325
4.396AsnLeu: 4.396 ± 0.327
1.697AsnMet: 1.697 ± 0.184
3.664AsnAsn: 3.664 ± 0.287
2.719AsnPro: 2.719 ± 0.222
1.697AsnGln: 1.697 ± 0.183
2.526AsnArg: 2.526 ± 0.196
3.336AsnSer: 3.336 ± 0.234
3.548AsnThr: 3.548 ± 0.316
2.989AsnVal: 2.989 ± 0.223
0.636AsnTrp: 0.636 ± 0.117
2.352AsnTyr: 2.352 ± 0.236
0.0AsnXaa: 0.0 ± 0.0
Pro
1.716ProAla: 1.716 ± 0.192
0.443ProCys: 0.443 ± 0.095
2.43ProAsp: 2.43 ± 0.22
3.162ProGlu: 3.162 ± 0.263
1.62ProPhe: 1.62 ± 0.202
2.16ProGly: 2.16 ± 0.19
0.636ProHis: 0.636 ± 0.127
2.16ProIle: 2.16 ± 0.207
2.237ProLys: 2.237 ± 0.228
1.967ProLeu: 1.967 ± 0.19
0.598ProMet: 0.598 ± 0.116
1.832ProAsn: 1.832 ± 0.194
0.752ProPro: 0.752 ± 0.141
0.713ProGln: 0.713 ± 0.122
1.273ProArg: 1.273 ± 0.153
1.986ProSer: 1.986 ± 0.212
2.565ProThr: 2.565 ± 0.231
2.777ProVal: 2.777 ± 0.228
0.521ProTrp: 0.521 ± 0.093
1.6ProTyr: 1.6 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
1.928GlnAla: 1.928 ± 0.205
0.27GlnCys: 0.27 ± 0.07
1.851GlnAsp: 1.851 ± 0.205
2.352GlnGlu: 2.352 ± 0.251
1.543GlnPhe: 1.543 ± 0.164
1.639GlnGly: 1.639 ± 0.163
0.54GlnHis: 0.54 ± 0.122
2.642GlnIle: 2.642 ± 0.229
2.719GlnLys: 2.719 ± 0.222
2.757GlnLeu: 2.757 ± 0.225
0.868GlnMet: 0.868 ± 0.126
1.292GlnAsn: 1.292 ± 0.167
0.868GlnPro: 0.868 ± 0.127
0.887GlnGln: 0.887 ± 0.12
1.253GlnArg: 1.253 ± 0.163
1.774GlnSer: 1.774 ± 0.211
1.89GlnThr: 1.89 ± 0.211
1.851GlnVal: 1.851 ± 0.203
0.386GlnTrp: 0.386 ± 0.084
1.678GlnTyr: 1.678 ± 0.176
0.0GlnXaa: 0.0 ± 0.0
Arg
2.275ArgAla: 2.275 ± 0.213
0.617ArgCys: 0.617 ± 0.107
2.333ArgAsp: 2.333 ± 0.218
2.931ArgGlu: 2.931 ± 0.231
1.6ArgPhe: 1.6 ± 0.183
2.275ArgGly: 2.275 ± 0.232
0.791ArgHis: 0.791 ± 0.133
3.529ArgIle: 3.529 ± 0.233
2.873ArgLys: 2.873 ± 0.263
3.336ArgLeu: 3.336 ± 0.24
0.945ArgMet: 0.945 ± 0.149
2.082ArgAsn: 2.082 ± 0.216
0.887ArgPro: 0.887 ± 0.107
1.253ArgGln: 1.253 ± 0.171
1.62ArgArg: 1.62 ± 0.198
2.082ArgSer: 2.082 ± 0.182
2.025ArgThr: 2.025 ± 0.178
2.449ArgVal: 2.449 ± 0.203
0.482ArgTrp: 0.482 ± 0.113
2.121ArgTyr: 2.121 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
3.124SerAla: 3.124 ± 0.274
0.868SerCys: 0.868 ± 0.131
4.743SerAsp: 4.743 ± 0.325
3.741SerGlu: 3.741 ± 0.228
3.182SerPhe: 3.182 ± 0.245
3.953SerGly: 3.953 ± 0.289
0.926SerHis: 0.926 ± 0.133
4.84SerIle: 4.84 ± 0.279
4.956SerLys: 4.956 ± 0.285
4.917SerLeu: 4.917 ± 0.287
1.041SerMet: 1.041 ± 0.15
3.297SerAsn: 3.297 ± 0.254
2.005SerPro: 2.005 ± 0.213
1.755SerGln: 1.755 ± 0.145
2.661SerArg: 2.661 ± 0.236
3.567SerSer: 3.567 ± 0.264
4.146SerThr: 4.146 ± 0.356
4.377SerVal: 4.377 ± 0.285
0.791SerTrp: 0.791 ± 0.107
2.815SerTyr: 2.815 ± 0.242
0.0SerXaa: 0.0 ± 0.0
Thr
2.912ThrAla: 2.912 ± 0.251
0.443ThrCys: 0.443 ± 0.094
3.741ThrAsp: 3.741 ± 0.303
4.647ThrGlu: 4.647 ± 0.285
2.777ThrPhe: 2.777 ± 0.243
3.972ThrGly: 3.972 ± 0.286
0.926ThrHis: 0.926 ± 0.138
4.878ThrIle: 4.878 ± 0.402
4.705ThrLys: 4.705 ± 0.317
4.242ThrLeu: 4.242 ± 0.278
1.061ThrMet: 1.061 ± 0.139
3.49ThrAsn: 3.49 ± 0.266
3.162ThrPro: 3.162 ± 0.277
2.063ThrGln: 2.063 ± 0.198
2.565ThrArg: 2.565 ± 0.207
3.124ThrSer: 3.124 ± 0.315
3.297ThrThr: 3.297 ± 0.27
4.512ThrVal: 4.512 ± 0.333
0.848ThrTrp: 0.848 ± 0.12
2.584ThrTyr: 2.584 ± 0.255
0.0ThrXaa: 0.0 ± 0.0
Val
2.584ValAla: 2.584 ± 0.235
0.868ValCys: 0.868 ± 0.135
4.608ValAsp: 4.608 ± 0.336
4.782ValGlu: 4.782 ± 0.36
2.912ValPhe: 2.912 ± 0.216
3.124ValGly: 3.124 ± 0.255
1.234ValHis: 1.234 ± 0.166
4.936ValIle: 4.936 ± 0.277
6.093ValLys: 6.093 ± 0.336
5.245ValLeu: 5.245 ± 0.279
1.6ValMet: 1.6 ± 0.165
4.011ValAsn: 4.011 ± 0.271
2.063ValPro: 2.063 ± 0.208
2.275ValGln: 2.275 ± 0.169
2.526ValArg: 2.526 ± 0.232
4.126ValSer: 4.126 ± 0.318
3.085ValThr: 3.085 ± 0.257
4.377ValVal: 4.377 ± 0.311
0.829ValTrp: 0.829 ± 0.108
2.989ValTyr: 2.989 ± 0.287
0.0ValXaa: 0.0 ± 0.0
Trp
0.713TrpAla: 0.713 ± 0.117
0.231TrpCys: 0.231 ± 0.071
1.08TrpAsp: 1.08 ± 0.131
0.694TrpGlu: 0.694 ± 0.117
0.617TrpPhe: 0.617 ± 0.104
0.424TrpGly: 0.424 ± 0.084
0.289TrpHis: 0.289 ± 0.083
0.926TrpIle: 0.926 ± 0.134
1.138TrpLys: 1.138 ± 0.173
1.041TrpLeu: 1.041 ± 0.142
0.405TrpMet: 0.405 ± 0.082
0.656TrpAsn: 0.656 ± 0.101
0.405TrpPro: 0.405 ± 0.084
0.309TrpGln: 0.309 ± 0.071
0.366TrpArg: 0.366 ± 0.083
0.868TrpSer: 0.868 ± 0.126
0.945TrpThr: 0.945 ± 0.161
0.868TrpVal: 0.868 ± 0.128
0.193TrpTrp: 0.193 ± 0.065
0.752TrpTyr: 0.752 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.584TyrAla: 2.584 ± 0.244
0.752TyrCys: 0.752 ± 0.119
3.355TyrAsp: 3.355 ± 0.23
2.873TyrGlu: 2.873 ± 0.225
2.487TyrPhe: 2.487 ± 0.24
2.873TyrGly: 2.873 ± 0.216
0.906TyrHis: 0.906 ± 0.133
4.03TyrIle: 4.03 ± 0.287
4.204TyrLys: 4.204 ± 0.321
2.931TyrLeu: 2.931 ± 0.215
1.118TyrMet: 1.118 ± 0.141
3.104TyrAsn: 3.104 ± 0.253
1.793TyrPro: 1.793 ± 0.211
1.581TyrGln: 1.581 ± 0.192
2.005TyrArg: 2.005 ± 0.192
2.661TyrSer: 2.661 ± 0.228
2.642TyrThr: 2.642 ± 0.232
3.182TyrVal: 3.182 ± 0.273
0.578TyrTrp: 0.578 ± 0.107
2.333TyrTyr: 2.333 ± 0.249
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 271 proteins (51862 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski