Amino acid dipepetide frequency for Salmonella phage SPAsTU

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.364AlaAla: 7.364 ± 0.476
0.706AlaCys: 0.706 ± 0.111
5.144AlaAsp: 5.144 ± 0.283
5.183AlaGlu: 5.183 ± 0.307
3.066AlaPhe: 3.066 ± 0.203
4.875AlaGly: 4.875 ± 0.312
1.36AlaHis: 1.36 ± 0.131
4.708AlaIle: 4.708 ± 0.26
4.182AlaLys: 4.182 ± 0.294
7.274AlaLeu: 7.274 ± 0.293
2.437AlaMet: 2.437 ± 0.168
3.515AlaAsn: 3.515 ± 0.203
2.938AlaPro: 2.938 ± 0.172
2.938AlaGln: 2.938 ± 0.195
3.72AlaArg: 3.72 ± 0.244
3.951AlaSer: 3.951 ± 0.278
4.593AlaThr: 4.593 ± 0.288
5.619AlaVal: 5.619 ± 0.251
1.155AlaTrp: 1.155 ± 0.132
3.002AlaTyr: 3.002 ± 0.217
0.0AlaXaa: 0.0 ± 0.0
Cys
0.59CysAla: 0.59 ± 0.094
0.141CysCys: 0.141 ± 0.039
0.564CysAsp: 0.564 ± 0.086
0.616CysGlu: 0.616 ± 0.103
0.295CysPhe: 0.295 ± 0.061
0.641CysGly: 0.641 ± 0.112
0.321CysHis: 0.321 ± 0.066
0.552CysIle: 0.552 ± 0.074
0.449CysLys: 0.449 ± 0.076
0.808CysLeu: 0.808 ± 0.095
0.244CysMet: 0.244 ± 0.057
0.372CysAsn: 0.372 ± 0.078
0.423CysPro: 0.423 ± 0.077
0.359CysGln: 0.359 ± 0.077
0.603CysArg: 0.603 ± 0.089
0.59CysSer: 0.59 ± 0.08
0.59CysThr: 0.59 ± 0.098
0.718CysVal: 0.718 ± 0.092
0.077CysTrp: 0.077 ± 0.028
0.411CysTyr: 0.411 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
5.337AspAla: 5.337 ± 0.276
0.59AspCys: 0.59 ± 0.102
4.503AspAsp: 4.503 ± 0.269
3.925AspGlu: 3.925 ± 0.237
2.604AspPhe: 2.604 ± 0.194
4.811AspGly: 4.811 ± 0.288
1.283AspHis: 1.283 ± 0.123
4.131AspIle: 4.131 ± 0.231
3.284AspLys: 3.284 ± 0.237
6.196AspLeu: 6.196 ± 0.299
1.693AspMet: 1.693 ± 0.128
2.848AspAsn: 2.848 ± 0.202
3.053AspPro: 3.053 ± 0.197
1.77AspGln: 1.77 ± 0.143
3.13AspArg: 3.13 ± 0.193
3.079AspSer: 3.079 ± 0.178
3.874AspThr: 3.874 ± 0.206
4.593AspVal: 4.593 ± 0.279
1.065AspTrp: 1.065 ± 0.117
2.835AspTyr: 2.835 ± 0.18
0.0AspXaa: 0.0 ± 0.0
Glu
4.464GluAla: 4.464 ± 0.274
0.552GluCys: 0.552 ± 0.079
3.861GluAsp: 3.861 ± 0.222
4.31GluGlu: 4.31 ± 0.378
2.745GluPhe: 2.745 ± 0.197
4.131GluGly: 4.131 ± 0.223
1.373GluHis: 1.373 ± 0.126
3.836GluIle: 3.836 ± 0.238
3.335GluLys: 3.335 ± 0.23
6.812GluLeu: 6.812 ± 0.322
2.001GluMet: 2.001 ± 0.133
2.489GluAsn: 2.489 ± 0.196
2.232GluPro: 2.232 ± 0.158
2.348GluGln: 2.348 ± 0.196
3.746GluArg: 3.746 ± 0.265
3.002GluSer: 3.002 ± 0.197
3.861GluThr: 3.861 ± 0.216
4.028GluVal: 4.028 ± 0.223
1.129GluTrp: 1.129 ± 0.106
2.604GluTyr: 2.604 ± 0.178
0.0GluXaa: 0.0 ± 0.0
Phe
2.899PheAla: 2.899 ± 0.177
0.321PheCys: 0.321 ± 0.057
2.963PheAsp: 2.963 ± 0.2
2.553PheGlu: 2.553 ± 0.195
1.629PhePhe: 1.629 ± 0.149
2.527PheGly: 2.527 ± 0.172
0.693PheHis: 0.693 ± 0.102
1.976PheIle: 1.976 ± 0.165
1.899PheLys: 1.899 ± 0.166
2.899PheLeu: 2.899 ± 0.198
1.167PheMet: 1.167 ± 0.105
2.604PheAsn: 2.604 ± 0.161
1.899PhePro: 1.899 ± 0.177
1.18PheGln: 1.18 ± 0.118
2.117PheArg: 2.117 ± 0.156
2.797PheSer: 2.797 ± 0.183
2.925PheThr: 2.925 ± 0.215
2.425PheVal: 2.425 ± 0.187
0.436PheTrp: 0.436 ± 0.074
1.822PheTyr: 1.822 ± 0.134
0.0PheXaa: 0.0 ± 0.0
Gly
4.208GlyAla: 4.208 ± 0.278
0.526GlyCys: 0.526 ± 0.095
3.849GlyAsp: 3.849 ± 0.255
4.541GlyGlu: 4.541 ± 0.277
3.079GlyPhe: 3.079 ± 0.191
4.439GlyGly: 4.439 ± 0.437
1.078GlyHis: 1.078 ± 0.113
3.99GlyIle: 3.99 ± 0.223
3.925GlyLys: 3.925 ± 0.251
5.76GlyLeu: 5.76 ± 0.262
1.937GlyMet: 1.937 ± 0.164
3.605GlyAsn: 3.605 ± 0.198
1.527GlyPro: 1.527 ± 0.156
2.463GlyGln: 2.463 ± 0.173
3.733GlyArg: 3.733 ± 0.249
3.99GlySer: 3.99 ± 0.313
4.374GlyThr: 4.374 ± 0.28
4.952GlyVal: 4.952 ± 0.262
1.206GlyTrp: 1.206 ± 0.121
2.745GlyTyr: 2.745 ± 0.174
0.0GlyXaa: 0.0 ± 0.0
His
1.257HisAla: 1.257 ± 0.116
0.244HisCys: 0.244 ± 0.051
1.27HisAsp: 1.27 ± 0.122
1.142HisGlu: 1.142 ± 0.133
0.988HisPhe: 0.988 ± 0.121
1.244HisGly: 1.244 ± 0.129
0.616HisHis: 0.616 ± 0.094
1.193HisIle: 1.193 ± 0.126
0.693HisLys: 0.693 ± 0.091
1.642HisLeu: 1.642 ± 0.17
0.526HisMet: 0.526 ± 0.084
0.706HisAsn: 0.706 ± 0.088
1.232HisPro: 1.232 ± 0.141
0.693HisGln: 0.693 ± 0.094
1.296HisArg: 1.296 ± 0.132
0.911HisSer: 0.911 ± 0.115
1.206HisThr: 1.206 ± 0.138
1.296HisVal: 1.296 ± 0.105
0.398HisTrp: 0.398 ± 0.074
1.27HisTyr: 1.27 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
4.67IleAla: 4.67 ± 0.282
0.603IleCys: 0.603 ± 0.089
4.015IleAsp: 4.015 ± 0.228
3.964IleGlu: 3.964 ± 0.233
1.604IlePhe: 1.604 ± 0.121
3.464IleGly: 3.464 ± 0.183
1.078IleHis: 1.078 ± 0.11
2.899IleIle: 2.899 ± 0.189
2.797IleLys: 2.797 ± 0.202
4.169IleLeu: 4.169 ± 0.235
1.257IleMet: 1.257 ± 0.131
3.117IleAsn: 3.117 ± 0.218
2.925IlePro: 2.925 ± 0.205
1.899IleGln: 1.899 ± 0.15
3.489IleArg: 3.489 ± 0.226
3.374IleSer: 3.374 ± 0.204
4.233IleThr: 4.233 ± 0.242
3.387IleVal: 3.387 ± 0.207
0.475IleTrp: 0.475 ± 0.085
2.181IleTyr: 2.181 ± 0.152
0.0IleXaa: 0.0 ± 0.0
Lys
4.233LysAla: 4.233 ± 0.293
0.398LysCys: 0.398 ± 0.068
3.246LysAsp: 3.246 ± 0.209
3.515LysGlu: 3.515 ± 0.248
2.027LysPhe: 2.027 ± 0.158
3.207LysGly: 3.207 ± 0.247
1.078LysHis: 1.078 ± 0.119
2.463LysIle: 2.463 ± 0.152
2.835LysLys: 2.835 ± 0.204
5.452LysLeu: 5.452 ± 0.278
1.911LysMet: 1.911 ± 0.156
2.283LysAsn: 2.283 ± 0.184
2.655LysPro: 2.655 ± 0.21
2.155LysGln: 2.155 ± 0.174
3.04LysArg: 3.04 ± 0.226
2.553LysSer: 2.553 ± 0.214
2.989LysThr: 2.989 ± 0.182
3.977LysVal: 3.977 ± 0.241
0.706LysTrp: 0.706 ± 0.094
2.001LysTyr: 2.001 ± 0.175
0.0LysXaa: 0.0 ± 0.0
Leu
7.453LeuAla: 7.453 ± 0.323
0.834LeuCys: 0.834 ± 0.134
5.991LeuAsp: 5.991 ± 0.277
5.106LeuGlu: 5.106 ± 0.288
3.31LeuPhe: 3.31 ± 0.197
5.606LeuGly: 5.606 ± 0.318
1.899LeuHis: 1.899 ± 0.215
4.362LeuIle: 4.362 ± 0.242
5.003LeuLys: 5.003 ± 0.203
7.479LeuLeu: 7.479 ± 0.355
2.437LeuMet: 2.437 ± 0.191
5.093LeuAsn: 5.093 ± 0.257
4.952LeuPro: 4.952 ± 0.234
3.323LeuGln: 3.323 ± 0.214
5.067LeuArg: 5.067 ± 0.243
5.914LeuSer: 5.914 ± 0.329
6.466LeuThr: 6.466 ± 0.282
5.644LeuVal: 5.644 ± 0.254
1.052LeuTrp: 1.052 ± 0.144
3.323LeuTyr: 3.323 ± 0.241
0.0LeuXaa: 0.0 ± 0.0
Met
2.194MetAla: 2.194 ± 0.21
0.244MetCys: 0.244 ± 0.051
1.411MetAsp: 1.411 ± 0.111
1.527MetGlu: 1.527 ± 0.146
1.193MetPhe: 1.193 ± 0.122
1.719MetGly: 1.719 ± 0.157
0.513MetHis: 0.513 ± 0.076
1.283MetIle: 1.283 ± 0.127
1.604MetLys: 1.604 ± 0.133
2.694MetLeu: 2.694 ± 0.192
0.86MetMet: 0.86 ± 0.101
1.437MetAsn: 1.437 ± 0.126
1.193MetPro: 1.193 ± 0.131
1.142MetGln: 1.142 ± 0.115
1.706MetArg: 1.706 ± 0.163
2.04MetSer: 2.04 ± 0.183
1.809MetThr: 1.809 ± 0.142
1.899MetVal: 1.899 ± 0.158
0.321MetTrp: 0.321 ± 0.057
1.039MetTyr: 1.039 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
4.31AsnAla: 4.31 ± 0.259
0.411AsnCys: 0.411 ± 0.076
2.874AsnAsp: 2.874 ± 0.187
2.912AsnGlu: 2.912 ± 0.21
1.757AsnPhe: 1.757 ± 0.143
4.285AsnGly: 4.285 ± 0.252
0.744AsnHis: 0.744 ± 0.092
2.54AsnIle: 2.54 ± 0.183
2.386AsnLys: 2.386 ± 0.153
4.182AsnLeu: 4.182 ± 0.236
0.911AsnMet: 0.911 ± 0.106
2.425AsnAsn: 2.425 ± 0.177
2.925AsnPro: 2.925 ± 0.218
2.001AsnGln: 2.001 ± 0.199
2.527AsnArg: 2.527 ± 0.166
2.553AsnSer: 2.553 ± 0.178
3.194AsnThr: 3.194 ± 0.214
3.669AsnVal: 3.669 ± 0.208
0.808AsnTrp: 0.808 ± 0.107
1.976AsnTyr: 1.976 ± 0.158
0.0AsnXaa: 0.0 ± 0.0
Pro
3.528ProAla: 3.528 ± 0.222
0.167ProCys: 0.167 ± 0.039
3.476ProAsp: 3.476 ± 0.272
3.618ProGlu: 3.618 ± 0.249
1.706ProPhe: 1.706 ± 0.161
2.732ProGly: 2.732 ± 0.193
0.744ProHis: 0.744 ± 0.093
2.476ProIle: 2.476 ± 0.2
2.309ProLys: 2.309 ± 0.19
3.81ProLeu: 3.81 ± 0.22
1.026ProMet: 1.026 ± 0.106
2.335ProAsn: 2.335 ± 0.163
1.668ProPro: 1.668 ± 0.139
1.488ProGln: 1.488 ± 0.151
2.168ProArg: 2.168 ± 0.176
2.694ProSer: 2.694 ± 0.211
3.104ProThr: 3.104 ± 0.179
3.759ProVal: 3.759 ± 0.249
0.641ProTrp: 0.641 ± 0.09
1.757ProTyr: 1.757 ± 0.159
0.0ProXaa: 0.0 ± 0.0
Gln
3.066GlnAla: 3.066 ± 0.233
0.449GlnCys: 0.449 ± 0.086
1.809GlnAsp: 1.809 ± 0.138
2.104GlnGlu: 2.104 ± 0.2
1.578GlnPhe: 1.578 ± 0.142
2.04GlnGly: 2.04 ± 0.152
0.783GlnHis: 0.783 ± 0.104
1.95GlnIle: 1.95 ± 0.166
1.822GlnLys: 1.822 ± 0.161
3.759GlnLeu: 3.759 ± 0.203
1.321GlnMet: 1.321 ± 0.122
1.398GlnAsn: 1.398 ± 0.135
1.642GlnPro: 1.642 ± 0.144
2.014GlnGln: 2.014 ± 0.176
2.643GlnArg: 2.643 ± 0.198
1.873GlnSer: 1.873 ± 0.132
2.091GlnThr: 2.091 ± 0.156
2.309GlnVal: 2.309 ± 0.166
0.603GlnTrp: 0.603 ± 0.079
1.514GlnTyr: 1.514 ± 0.14
0.0GlnXaa: 0.0 ± 0.0
Arg
3.374ArgAla: 3.374 ± 0.221
0.693ArgCys: 0.693 ± 0.097
3.502ArgAsp: 3.502 ± 0.28
3.4ArgGlu: 3.4 ± 0.244
2.579ArgPhe: 2.579 ± 0.194
3.374ArgGly: 3.374 ± 0.225
1.103ArgHis: 1.103 ± 0.107
3.4ArgIle: 3.4 ± 0.223
3.412ArgLys: 3.412 ± 0.219
5.234ArgLeu: 5.234 ± 0.267
1.616ArgMet: 1.616 ± 0.146
2.899ArgAsn: 2.899 ± 0.181
2.04ArgPro: 2.04 ± 0.201
2.206ArgGln: 2.206 ± 0.186
3.348ArgArg: 3.348 ± 0.245
2.694ArgSer: 2.694 ± 0.194
3.04ArgThr: 3.04 ± 0.22
3.695ArgVal: 3.695 ± 0.216
1.206ArgTrp: 1.206 ± 0.14
2.425ArgTyr: 2.425 ± 0.178
0.0ArgXaa: 0.0 ± 0.0
Ser
4.195SerAla: 4.195 ± 0.255
0.526SerCys: 0.526 ± 0.099
3.117SerAsp: 3.117 ± 0.216
3.156SerGlu: 3.156 ± 0.226
2.245SerPhe: 2.245 ± 0.161
3.99SerGly: 3.99 ± 0.245
1.09SerHis: 1.09 ± 0.12
3.31SerIle: 3.31 ± 0.205
3.015SerLys: 3.015 ± 0.221
5.093SerLeu: 5.093 ± 0.264
1.539SerMet: 1.539 ± 0.137
2.655SerAsn: 2.655 ± 0.185
2.591SerPro: 2.591 ± 0.209
2.065SerGln: 2.065 ± 0.195
2.976SerArg: 2.976 ± 0.181
3.412SerSer: 3.412 ± 0.197
3.579SerThr: 3.579 ± 0.224
4.041SerVal: 4.041 ± 0.231
0.808SerTrp: 0.808 ± 0.095
2.091SerTyr: 2.091 ± 0.161
0.0SerXaa: 0.0 ± 0.0
Thr
4.888ThrAla: 4.888 ± 0.244
0.564ThrCys: 0.564 ± 0.091
4.054ThrAsp: 4.054 ± 0.191
3.592ThrGlu: 3.592 ± 0.217
2.694ThrPhe: 2.694 ± 0.236
4.734ThrGly: 4.734 ± 0.265
1.334ThrHis: 1.334 ± 0.125
3.528ThrIle: 3.528 ± 0.216
3.028ThrLys: 3.028 ± 0.183
6.504ThrLeu: 6.504 ± 0.294
1.552ThrMet: 1.552 ± 0.14
3.104ThrAsn: 3.104 ± 0.212
3.746ThrPro: 3.746 ± 0.231
2.245ThrGln: 2.245 ± 0.175
2.797ThrArg: 2.797 ± 0.182
3.284ThrSer: 3.284 ± 0.208
4.644ThrThr: 4.644 ± 0.396
5.324ThrVal: 5.324 ± 0.292
1.052ThrTrp: 1.052 ± 0.127
2.322ThrTyr: 2.322 ± 0.153
0.0ThrXaa: 0.0 ± 0.0
Val
5.965ValAla: 5.965 ± 0.298
0.718ValCys: 0.718 ± 0.098
4.965ValAsp: 4.965 ± 0.252
4.772ValGlu: 4.772 ± 0.25
2.514ValPhe: 2.514 ± 0.168
4.259ValGly: 4.259 ± 0.244
1.373ValHis: 1.373 ± 0.13
4.118ValIle: 4.118 ± 0.187
4.067ValLys: 4.067 ± 0.215
5.439ValLeu: 5.439 ± 0.302
1.591ValMet: 1.591 ± 0.123
3.515ValAsn: 3.515 ± 0.197
3.258ValPro: 3.258 ± 0.195
2.219ValGln: 2.219 ± 0.189
3.656ValArg: 3.656 ± 0.234
3.887ValSer: 3.887 ± 0.253
4.965ValThr: 4.965 ± 0.331
5.747ValVal: 5.747 ± 0.32
1.078ValTrp: 1.078 ± 0.102
2.668ValTyr: 2.668 ± 0.185
0.0ValXaa: 0.0 ± 0.0
Trp
1.167TrpAla: 1.167 ± 0.109
0.154TrpCys: 0.154 ± 0.044
0.988TrpAsp: 0.988 ± 0.121
0.885TrpGlu: 0.885 ± 0.096
0.616TrpPhe: 0.616 ± 0.081
0.808TrpGly: 0.808 ± 0.096
0.321TrpHis: 0.321 ± 0.072
0.783TrpIle: 0.783 ± 0.093
0.975TrpLys: 0.975 ± 0.1
1.539TrpLeu: 1.539 ± 0.132
0.487TrpMet: 0.487 ± 0.08
0.718TrpAsn: 0.718 ± 0.096
0.462TrpPro: 0.462 ± 0.081
0.552TrpGln: 0.552 ± 0.089
1.001TrpArg: 1.001 ± 0.101
0.924TrpSer: 0.924 ± 0.103
0.834TrpThr: 0.834 ± 0.125
0.988TrpVal: 0.988 ± 0.103
0.282TrpTrp: 0.282 ± 0.069
0.577TrpTyr: 0.577 ± 0.084
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.566TyrAla: 2.566 ± 0.169
0.526TyrCys: 0.526 ± 0.096
3.002TyrAsp: 3.002 ± 0.219
2.04TyrGlu: 2.04 ± 0.181
1.501TyrPhe: 1.501 ± 0.146
3.04TyrGly: 3.04 ± 0.178
1.039TyrHis: 1.039 ± 0.115
2.104TyrIle: 2.104 ± 0.15
1.706TyrLys: 1.706 ± 0.161
3.566TyrLeu: 3.566 ± 0.214
1.296TyrMet: 1.296 ± 0.14
2.271TyrAsn: 2.271 ± 0.2
1.847TyrPro: 1.847 ± 0.172
1.719TyrGln: 1.719 ± 0.15
2.476TyrArg: 2.476 ± 0.18
1.976TyrSer: 1.976 ± 0.168
2.617TyrThr: 2.617 ± 0.194
2.707TyrVal: 2.707 ± 0.214
0.564TyrTrp: 0.564 ± 0.088
2.027TyrTyr: 2.027 ± 0.173
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 287 proteins (77953 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski