Amino acid dipepetide frequency for Citrobacter phage Moon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.123AlaAla: 5.123 ± 0.339
0.462AlaCys: 0.462 ± 0.098
4.032AlaAsp: 4.032 ± 0.275
5.012AlaGlu: 5.012 ± 0.366
2.33AlaPhe: 2.33 ± 0.185
3.958AlaGly: 3.958 ± 0.295
1.147AlaHis: 1.147 ± 0.153
5.215AlaIle: 5.215 ± 0.321
5.178AlaLys: 5.178 ± 0.261
5.992AlaLeu: 5.992 ± 0.418
1.831AlaMet: 1.831 ± 0.168
3.125AlaAsn: 3.125 ± 0.219
2.275AlaPro: 2.275 ± 0.218
2.589AlaGln: 2.589 ± 0.241
2.996AlaArg: 2.996 ± 0.211
3.828AlaSer: 3.828 ± 0.329
3.791AlaThr: 3.791 ± 0.374
4.476AlaVal: 4.476 ± 0.333
1.147AlaTrp: 1.147 ± 0.129
2.589AlaTyr: 2.589 ± 0.267
0.0AlaXaa: 0.0 ± 0.0
Cys
0.518CysAla: 0.518 ± 0.108
0.111CysCys: 0.111 ± 0.058
0.814CysAsp: 0.814 ± 0.123
0.795CysGlu: 0.795 ± 0.141
0.518CysPhe: 0.518 ± 0.112
0.795CysGly: 0.795 ± 0.137
0.37CysHis: 0.37 ± 0.088
0.61CysIle: 0.61 ± 0.109
0.758CysLys: 0.758 ± 0.143
0.647CysLeu: 0.647 ± 0.119
0.37CysMet: 0.37 ± 0.089
0.462CysAsn: 0.462 ± 0.102
0.499CysPro: 0.499 ± 0.109
0.425CysGln: 0.425 ± 0.088
0.555CysArg: 0.555 ± 0.11
0.869CysSer: 0.869 ± 0.129
0.592CysThr: 0.592 ± 0.095
0.592CysVal: 0.592 ± 0.111
0.203CysTrp: 0.203 ± 0.053
0.462CysTyr: 0.462 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
4.272AspAla: 4.272 ± 0.306
0.573AspCys: 0.573 ± 0.111
4.291AspAsp: 4.291 ± 0.27
4.549AspGlu: 4.549 ± 0.349
3.181AspPhe: 3.181 ± 0.235
4.272AspGly: 4.272 ± 0.272
1.073AspHis: 1.073 ± 0.147
5.141AspIle: 5.141 ± 0.338
4.494AspLys: 4.494 ± 0.285
5.197AspLeu: 5.197 ± 0.323
1.794AspMet: 1.794 ± 0.18
3.495AspAsn: 3.495 ± 0.189
2.238AspPro: 2.238 ± 0.204
1.942AspGln: 1.942 ± 0.201
2.645AspArg: 2.645 ± 0.227
3.921AspSer: 3.921 ± 0.255
3.255AspThr: 3.255 ± 0.241
4.032AspVal: 4.032 ± 0.242
1.387AspTrp: 1.387 ± 0.182
3.458AspTyr: 3.458 ± 0.252
0.0AspXaa: 0.0 ± 0.0
Glu
5.215GluAla: 5.215 ± 0.355
0.98GluCys: 0.98 ± 0.137
4.383GluAsp: 4.383 ± 0.283
5.197GluGlu: 5.197 ± 0.382
3.551GluPhe: 3.551 ± 0.269
3.958GluGly: 3.958 ± 0.285
1.387GluHis: 1.387 ± 0.145
5.604GluIle: 5.604 ± 0.378
5.234GluLys: 5.234 ± 0.306
7.342GluLeu: 7.342 ± 0.436
2.645GluMet: 2.645 ± 0.238
3.421GluAsn: 3.421 ± 0.268
1.553GluPro: 1.553 ± 0.167
2.219GluGln: 2.219 ± 0.221
2.756GluArg: 2.756 ± 0.251
4.346GluSer: 4.346 ± 0.318
3.791GluThr: 3.791 ± 0.283
5.178GluVal: 5.178 ± 0.308
0.906GluTrp: 0.906 ± 0.119
3.162GluTyr: 3.162 ± 0.25
0.0GluXaa: 0.0 ± 0.0
Phe
2.811PheAla: 2.811 ± 0.243
0.592PheCys: 0.592 ± 0.09
3.495PheAsp: 3.495 ± 0.221
3.754PheGlu: 3.754 ± 0.263
1.572PhePhe: 1.572 ± 0.173
3.125PheGly: 3.125 ± 0.218
0.795PheHis: 0.795 ± 0.116
3.236PheIle: 3.236 ± 0.212
3.68PheLys: 3.68 ± 0.245
2.312PheLeu: 2.312 ± 0.197
1.295PheMet: 1.295 ± 0.169
3.014PheAsn: 3.014 ± 0.262
1.147PhePro: 1.147 ± 0.135
1.516PheGln: 1.516 ± 0.182
1.905PheArg: 1.905 ± 0.161
2.719PheSer: 2.719 ± 0.204
2.127PheThr: 2.127 ± 0.199
2.885PheVal: 2.885 ± 0.25
0.61PheTrp: 0.61 ± 0.104
1.498PheTyr: 1.498 ± 0.171
0.0PheXaa: 0.0 ± 0.0
Gly
3.865GlyAla: 3.865 ± 0.293
0.647GlyCys: 0.647 ± 0.103
4.254GlyAsp: 4.254 ± 0.319
4.18GlyGlu: 4.18 ± 0.233
3.07GlyPhe: 3.07 ± 0.272
3.828GlyGly: 3.828 ± 0.313
0.98GlyHis: 0.98 ± 0.152
3.606GlyIle: 3.606 ± 0.274
4.494GlyLys: 4.494 ± 0.321
4.882GlyLeu: 4.882 ± 0.341
1.794GlyMet: 1.794 ± 0.177
2.996GlyAsn: 2.996 ± 0.322
1.59GlyPro: 1.59 ± 0.183
2.33GlyGln: 2.33 ± 0.227
2.867GlyArg: 2.867 ± 0.231
3.995GlySer: 3.995 ± 0.257
4.069GlyThr: 4.069 ± 0.282
4.235GlyVal: 4.235 ± 0.294
0.943GlyTrp: 0.943 ± 0.136
2.83GlyTyr: 2.83 ± 0.195
0.0GlyXaa: 0.0 ± 0.0
His
0.943HisAla: 0.943 ± 0.152
0.314HisCys: 0.314 ± 0.077
1.369HisAsp: 1.369 ± 0.144
1.276HisGlu: 1.276 ± 0.149
0.98HisPhe: 0.98 ± 0.142
1.35HisGly: 1.35 ± 0.162
0.37HisHis: 0.37 ± 0.088
1.369HisIle: 1.369 ± 0.188
1.48HisLys: 1.48 ± 0.197
1.646HisLeu: 1.646 ± 0.189
0.536HisMet: 0.536 ± 0.097
0.795HisAsn: 0.795 ± 0.12
0.832HisPro: 0.832 ± 0.118
0.61HisGln: 0.61 ± 0.101
0.666HisArg: 0.666 ± 0.132
1.11HisSer: 1.11 ± 0.142
1.11HisThr: 1.11 ± 0.162
1.202HisVal: 1.202 ± 0.141
0.37HisTrp: 0.37 ± 0.094
0.758HisTyr: 0.758 ± 0.117
0.0HisXaa: 0.0 ± 0.0
Ile
4.66IleAla: 4.66 ± 0.268
0.795IleCys: 0.795 ± 0.109
4.901IleAsp: 4.901 ± 0.275
5.659IleGlu: 5.659 ± 0.303
2.256IlePhe: 2.256 ± 0.187
3.81IleGly: 3.81 ± 0.291
1.424IleHis: 1.424 ± 0.187
4.439IleIle: 4.439 ± 0.303
6.177IleLys: 6.177 ± 0.339
4.291IleLeu: 4.291 ± 0.3
2.127IleMet: 2.127 ± 0.206
4.697IleAsn: 4.697 ± 0.255
3.255IlePro: 3.255 ± 0.194
2.404IleGln: 2.404 ± 0.21
3.865IleArg: 3.865 ± 0.267
4.642IleSer: 4.642 ± 0.303
4.513IleThr: 4.513 ± 0.272
4.032IleVal: 4.032 ± 0.278
0.869IleTrp: 0.869 ± 0.145
2.7IleTyr: 2.7 ± 0.216
0.0IleXaa: 0.0 ± 0.0
Lys
5.844LysAla: 5.844 ± 0.373
0.795LysCys: 0.795 ± 0.12
4.827LysAsp: 4.827 ± 0.333
5.215LysGlu: 5.215 ± 0.354
3.532LysPhe: 3.532 ± 0.293
4.013LysGly: 4.013 ± 0.244
1.59LysHis: 1.59 ± 0.208
5.419LysIle: 5.419 ± 0.298
4.679LysLys: 4.679 ± 0.301
6.029LysLeu: 6.029 ± 0.349
2.867LysMet: 2.867 ± 0.237
3.995LysAsn: 3.995 ± 0.255
2.552LysPro: 2.552 ± 0.223
2.497LysGln: 2.497 ± 0.221
3.366LysArg: 3.366 ± 0.288
4.605LysSer: 4.605 ± 0.315
4.18LysThr: 4.18 ± 0.282
4.623LysVal: 4.623 ± 0.284
1.406LysTrp: 1.406 ± 0.183
2.996LysTyr: 2.996 ± 0.241
0.0LysXaa: 0.0 ± 0.0
Leu
5.049LeuAla: 5.049 ± 0.311
0.943LeuCys: 0.943 ± 0.145
5.456LeuAsp: 5.456 ± 0.264
5.511LeuGlu: 5.511 ± 0.347
3.421LeuPhe: 3.421 ± 0.268
4.402LeuGly: 4.402 ± 0.293
1.128LeuHis: 1.128 ± 0.126
5.382LeuIle: 5.382 ± 0.344
5.863LeuLys: 5.863 ± 0.354
4.383LeuLeu: 4.383 ± 0.352
2.626LeuMet: 2.626 ± 0.258
4.309LeuAsn: 4.309 ± 0.279
3.236LeuPro: 3.236 ± 0.252
2.423LeuGln: 2.423 ± 0.225
3.828LeuArg: 3.828 ± 0.265
4.642LeuSer: 4.642 ± 0.252
4.217LeuThr: 4.217 ± 0.32
4.568LeuVal: 4.568 ± 0.295
0.703LeuTrp: 0.703 ± 0.111
2.626LeuTyr: 2.626 ± 0.236
0.0LeuXaa: 0.0 ± 0.0
Met
2.164MetAla: 2.164 ± 0.199
0.296MetCys: 0.296 ± 0.082
1.701MetAsp: 1.701 ± 0.175
1.942MetGlu: 1.942 ± 0.209
1.48MetPhe: 1.48 ± 0.152
1.35MetGly: 1.35 ± 0.164
0.462MetHis: 0.462 ± 0.084
2.182MetIle: 2.182 ± 0.215
3.181MetLys: 3.181 ± 0.213
2.312MetLeu: 2.312 ± 0.181
0.962MetMet: 0.962 ± 0.141
2.016MetAsn: 2.016 ± 0.191
0.666MetPro: 0.666 ± 0.117
1.221MetGln: 1.221 ± 0.161
1.073MetArg: 1.073 ± 0.15
2.127MetSer: 2.127 ± 0.221
1.979MetThr: 1.979 ± 0.195
1.406MetVal: 1.406 ± 0.15
0.24MetTrp: 0.24 ± 0.07
1.202MetTyr: 1.202 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
3.606AsnAla: 3.606 ± 0.256
0.869AsnCys: 0.869 ± 0.13
3.31AsnAsp: 3.31 ± 0.21
4.272AsnGlu: 4.272 ± 0.206
2.367AsnPhe: 2.367 ± 0.199
3.865AsnGly: 3.865 ± 0.248
0.962AsnHis: 0.962 ± 0.152
3.643AsnIle: 3.643 ± 0.275
3.736AsnLys: 3.736 ± 0.232
3.717AsnLeu: 3.717 ± 0.315
1.424AsnMet: 1.424 ± 0.156
3.033AsnAsn: 3.033 ± 0.228
2.423AsnPro: 2.423 ± 0.223
1.905AsnGln: 1.905 ± 0.19
2.774AsnArg: 2.774 ± 0.22
3.31AsnSer: 3.31 ± 0.258
2.608AsnThr: 2.608 ± 0.207
3.107AsnVal: 3.107 ± 0.234
0.536AsnTrp: 0.536 ± 0.104
2.256AsnTyr: 2.256 ± 0.219
0.0AsnXaa: 0.0 ± 0.0
Pro
2.238ProAla: 2.238 ± 0.2
0.407ProCys: 0.407 ± 0.086
2.33ProAsp: 2.33 ± 0.204
3.033ProGlu: 3.033 ± 0.262
1.369ProPhe: 1.369 ± 0.143
2.608ProGly: 2.608 ± 0.229
0.925ProHis: 0.925 ± 0.122
2.127ProIle: 2.127 ± 0.212
2.293ProLys: 2.293 ± 0.194
2.108ProLeu: 2.108 ± 0.203
0.832ProMet: 0.832 ± 0.13
1.942ProAsn: 1.942 ± 0.213
1.054ProPro: 1.054 ± 0.135
0.777ProGln: 0.777 ± 0.113
1.48ProArg: 1.48 ± 0.172
2.145ProSer: 2.145 ± 0.178
2.127ProThr: 2.127 ± 0.212
2.663ProVal: 2.663 ± 0.217
0.703ProTrp: 0.703 ± 0.131
1.332ProTyr: 1.332 ± 0.164
0.0ProXaa: 0.0 ± 0.0
Gln
1.868GlnAla: 1.868 ± 0.213
0.277GlnCys: 0.277 ± 0.07
1.738GlnAsp: 1.738 ± 0.165
2.275GlnGlu: 2.275 ± 0.192
1.831GlnPhe: 1.831 ± 0.175
1.757GlnGly: 1.757 ± 0.186
0.629GlnHis: 0.629 ± 0.126
2.589GlnIle: 2.589 ± 0.219
2.441GlnLys: 2.441 ± 0.214
3.144GlnLeu: 3.144 ± 0.243
1.128GlnMet: 1.128 ± 0.128
1.59GlnAsn: 1.59 ± 0.175
1.239GlnPro: 1.239 ± 0.173
1.11GlnGln: 1.11 ± 0.175
1.794GlnArg: 1.794 ± 0.177
2.016GlnSer: 2.016 ± 0.191
2.127GlnThr: 2.127 ± 0.212
2.404GlnVal: 2.404 ± 0.225
0.666GlnTrp: 0.666 ± 0.117
1.646GlnTyr: 1.646 ± 0.194
0.0GlnXaa: 0.0 ± 0.0
Arg
3.088ArgAla: 3.088 ± 0.244
0.555ArgCys: 0.555 ± 0.105
2.885ArgAsp: 2.885 ± 0.176
3.421ArgGlu: 3.421 ± 0.281
2.09ArgPhe: 2.09 ± 0.197
2.867ArgGly: 2.867 ± 0.209
1.054ArgHis: 1.054 ± 0.133
3.699ArgIle: 3.699 ± 0.293
3.255ArgLys: 3.255 ± 0.269
3.44ArgLeu: 3.44 ± 0.264
1.387ArgMet: 1.387 ± 0.158
2.219ArgAsn: 2.219 ± 0.228
1.258ArgPro: 1.258 ± 0.145
1.738ArgGln: 1.738 ± 0.188
2.256ArgArg: 2.256 ± 0.211
2.811ArgSer: 2.811 ± 0.233
2.293ArgThr: 2.293 ± 0.215
3.125ArgVal: 3.125 ± 0.241
0.684ArgTrp: 0.684 ± 0.122
1.812ArgTyr: 1.812 ± 0.173
0.0ArgXaa: 0.0 ± 0.0
Ser
3.773SerAla: 3.773 ± 0.263
0.61SerCys: 0.61 ± 0.099
3.662SerAsp: 3.662 ± 0.255
4.05SerGlu: 4.05 ± 0.252
2.756SerPhe: 2.756 ± 0.201
4.623SerGly: 4.623 ± 0.289
1.11SerHis: 1.11 ± 0.146
4.365SerIle: 4.365 ± 0.327
5.03SerLys: 5.03 ± 0.314
4.568SerLeu: 4.568 ± 0.292
1.72SerMet: 1.72 ± 0.169
3.347SerAsn: 3.347 ± 0.275
1.831SerPro: 1.831 ± 0.216
2.164SerGln: 2.164 ± 0.192
2.904SerArg: 2.904 ± 0.249
4.032SerSer: 4.032 ± 0.308
3.81SerThr: 3.81 ± 0.294
4.198SerVal: 4.198 ± 0.3
0.906SerTrp: 0.906 ± 0.111
2.645SerTyr: 2.645 ± 0.303
0.0SerXaa: 0.0 ± 0.0
Thr
3.81ThrAla: 3.81 ± 0.262
0.444ThrCys: 0.444 ± 0.083
3.144ThrAsp: 3.144 ± 0.249
3.773ThrGlu: 3.773 ± 0.331
2.663ThrPhe: 2.663 ± 0.222
4.291ThrGly: 4.291 ± 0.292
1.276ThrHis: 1.276 ± 0.152
4.254ThrIle: 4.254 ± 0.295
3.754ThrLys: 3.754 ± 0.232
4.013ThrLeu: 4.013 ± 0.288
1.295ThrMet: 1.295 ± 0.16
2.83ThrAsn: 2.83 ± 0.216
2.867ThrPro: 2.867 ± 0.286
1.868ThrGln: 1.868 ± 0.171
2.904ThrArg: 2.904 ± 0.196
3.144ThrSer: 3.144 ± 0.292
3.366ThrThr: 3.366 ± 0.294
3.68ThrVal: 3.68 ± 0.302
0.869ThrTrp: 0.869 ± 0.119
2.201ThrTyr: 2.201 ± 0.204
0.0ThrXaa: 0.0 ± 0.0
Val
4.346ValAla: 4.346 ± 0.305
0.592ValCys: 0.592 ± 0.092
4.291ValAsp: 4.291 ± 0.293
5.086ValGlu: 5.086 ± 0.302
2.589ValPhe: 2.589 ± 0.219
3.088ValGly: 3.088 ± 0.246
1.258ValHis: 1.258 ± 0.154
4.808ValIle: 4.808 ± 0.262
5.215ValLys: 5.215 ± 0.368
4.457ValLeu: 4.457 ± 0.27
1.775ValMet: 1.775 ± 0.183
3.421ValAsn: 3.421 ± 0.238
2.182ValPro: 2.182 ± 0.22
2.312ValGln: 2.312 ± 0.233
3.07ValArg: 3.07 ± 0.21
4.457ValSer: 4.457 ± 0.263
3.514ValThr: 3.514 ± 0.304
4.494ValVal: 4.494 ± 0.333
0.832ValTrp: 0.832 ± 0.117
2.959ValTyr: 2.959 ± 0.216
0.0ValXaa: 0.0 ± 0.0
Trp
1.017TrpAla: 1.017 ± 0.148
0.148TrpCys: 0.148 ± 0.06
1.036TrpAsp: 1.036 ± 0.129
1.165TrpGlu: 1.165 ± 0.129
0.703TrpPhe: 0.703 ± 0.12
0.573TrpGly: 0.573 ± 0.114
0.259TrpHis: 0.259 ± 0.082
0.869TrpIle: 0.869 ± 0.132
1.295TrpLys: 1.295 ± 0.156
1.239TrpLeu: 1.239 ± 0.143
0.407TrpMet: 0.407 ± 0.08
0.74TrpAsn: 0.74 ± 0.122
0.388TrpPro: 0.388 ± 0.093
0.684TrpGln: 0.684 ± 0.118
0.573TrpArg: 0.573 ± 0.085
0.74TrpSer: 0.74 ± 0.108
0.832TrpThr: 0.832 ± 0.118
1.036TrpVal: 1.036 ± 0.124
0.296TrpTrp: 0.296 ± 0.081
0.758TrpTyr: 0.758 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.885TyrAla: 2.885 ± 0.253
0.555TyrCys: 0.555 ± 0.1
3.292TyrAsp: 3.292 ± 0.267
2.682TyrGlu: 2.682 ± 0.245
1.831TyrPhe: 1.831 ± 0.173
2.922TyrGly: 2.922 ± 0.243
0.888TyrHis: 0.888 ± 0.122
2.959TyrIle: 2.959 ± 0.226
2.793TyrLys: 2.793 ± 0.271
2.996TyrLeu: 2.996 ± 0.297
1.128TyrMet: 1.128 ± 0.129
2.275TyrAsn: 2.275 ± 0.211
1.406TyrPro: 1.406 ± 0.132
1.535TyrGln: 1.535 ± 0.185
1.664TyrArg: 1.664 ± 0.181
2.645TyrSer: 2.645 ± 0.208
2.145TyrThr: 2.145 ± 0.189
2.811TyrVal: 2.811 ± 0.213
0.481TyrTrp: 0.481 ± 0.082
1.683TyrTyr: 1.683 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 298 proteins (54073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski