Amino acid dipepetide frequency for Salmonella phage Det7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.228AlaAla: 5.228 ± 0.378
0.738AlaCys: 0.738 ± 0.146
3.998AlaAsp: 3.998 ± 0.302
4.08AlaGlu: 4.08 ± 0.403
2.522AlaPhe: 2.522 ± 0.229
4.203AlaGly: 4.203 ± 0.355
1.456AlaHis: 1.456 ± 0.177
4.326AlaIle: 4.326 ± 0.297
3.977AlaLys: 3.977 ± 0.303
5.33AlaLeu: 5.33 ± 0.359
2.009AlaMet: 2.009 ± 0.221
3.403AlaAsn: 3.403 ± 0.242
2.419AlaPro: 2.419 ± 0.261
2.358AlaGln: 2.358 ± 0.239
3.424AlaArg: 3.424 ± 0.254
4.162AlaSer: 4.162 ± 0.272
4.203AlaThr: 4.203 ± 0.311
4.9AlaVal: 4.9 ± 0.294
0.779AlaTrp: 0.779 ± 0.146
2.501AlaTyr: 2.501 ± 0.244
0.0AlaXaa: 0.0 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.1
0.164CysCys: 0.164 ± 0.069
0.82CysAsp: 0.82 ± 0.152
0.902CysGlu: 0.902 ± 0.143
0.369CysPhe: 0.369 ± 0.094
0.8CysGly: 0.8 ± 0.149
0.472CysHis: 0.472 ± 0.11
0.759CysIle: 0.759 ± 0.119
0.902CysLys: 0.902 ± 0.127
0.697CysLeu: 0.697 ± 0.142
0.349CysMet: 0.349 ± 0.076
0.595CysAsn: 0.595 ± 0.112
0.574CysPro: 0.574 ± 0.104
0.267CysGln: 0.267 ± 0.068
0.451CysArg: 0.451 ± 0.096
0.697CysSer: 0.697 ± 0.102
0.677CysThr: 0.677 ± 0.126
0.984CysVal: 0.984 ± 0.151
0.144CysTrp: 0.144 ± 0.053
0.39CysTyr: 0.39 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
4.326AspAla: 4.326 ± 0.316
0.779AspCys: 0.779 ± 0.124
3.772AspAsp: 3.772 ± 0.291
3.936AspGlu: 3.936 ± 0.337
3.116AspPhe: 3.116 ± 0.252
4.982AspGly: 4.982 ± 0.349
0.943AspHis: 0.943 ± 0.15
4.408AspIle: 4.408 ± 0.294
3.67AspLys: 3.67 ± 0.252
5.987AspLeu: 5.987 ± 0.355
2.05AspMet: 2.05 ± 0.206
2.932AspAsn: 2.932 ± 0.241
2.768AspPro: 2.768 ± 0.249
2.05AspGln: 2.05 ± 0.179
2.132AspArg: 2.132 ± 0.224
3.957AspSer: 3.957 ± 0.281
3.219AspThr: 3.219 ± 0.316
4.08AspVal: 4.08 ± 0.249
0.923AspTrp: 0.923 ± 0.149
3.116AspTyr: 3.116 ± 0.264
0.0AspXaa: 0.0 ± 0.0
Glu
4.367GluAla: 4.367 ± 0.386
0.779GluCys: 0.779 ± 0.133
4.162GluAsp: 4.162 ± 0.331
4.367GluGlu: 4.367 ± 0.409
2.932GluPhe: 2.932 ± 0.262
4.285GluGly: 4.285 ± 0.237
1.415GluHis: 1.415 ± 0.188
4.223GluIle: 4.223 ± 0.272
3.506GluLys: 3.506 ± 0.353
6.335GluLeu: 6.335 ± 0.372
1.989GluMet: 1.989 ± 0.217
3.403GluAsn: 3.403 ± 0.224
2.091GluPro: 2.091 ± 0.222
2.706GluGln: 2.706 ± 0.251
3.608GluArg: 3.608 ± 0.333
3.998GluSer: 3.998 ± 0.302
3.506GluThr: 3.506 ± 0.229
4.387GluVal: 4.387 ± 0.305
1.066GluTrp: 1.066 ± 0.153
2.993GluTyr: 2.993 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
2.337PheAla: 2.337 ± 0.197
0.595PheCys: 0.595 ± 0.105
2.747PheAsp: 2.747 ± 0.243
3.157PheGlu: 3.157 ± 0.269
1.661PhePhe: 1.661 ± 0.162
3.198PheGly: 3.198 ± 0.281
0.964PheHis: 0.964 ± 0.16
2.727PheIle: 2.727 ± 0.213
2.788PheLys: 2.788 ± 0.24
2.85PheLeu: 2.85 ± 0.239
1.23PheMet: 1.23 ± 0.151
2.727PheAsn: 2.727 ± 0.223
1.517PhePro: 1.517 ± 0.207
1.579PheGln: 1.579 ± 0.166
2.276PheArg: 2.276 ± 0.244
2.952PheSer: 2.952 ± 0.215
2.706PheThr: 2.706 ± 0.245
3.014PheVal: 3.014 ± 0.219
0.738PheTrp: 0.738 ± 0.139
1.435PheTyr: 1.435 ± 0.153
0.0PheXaa: 0.0 ± 0.0
Gly
3.67GlyAla: 3.67 ± 0.281
0.902GlyCys: 0.902 ± 0.145
4.039GlyAsp: 4.039 ± 0.345
4.695GlyGlu: 4.695 ± 0.316
2.768GlyPhe: 2.768 ± 0.242
5.166GlyGly: 5.166 ± 0.465
1.148GlyHis: 1.148 ± 0.137
4.695GlyIle: 4.695 ± 0.303
5.248GlyLys: 5.248 ± 0.384
5.043GlyLeu: 5.043 ± 0.286
2.03GlyMet: 2.03 ± 0.202
3.485GlyAsn: 3.485 ± 0.377
1.148GlyPro: 1.148 ± 0.146
2.378GlyGln: 2.378 ± 0.231
2.911GlyArg: 2.911 ± 0.252
4.92GlySer: 4.92 ± 0.424
3.793GlyThr: 3.793 ± 0.324
5.125GlyVal: 5.125 ± 0.411
1.394GlyTrp: 1.394 ± 0.19
2.747GlyTyr: 2.747 ± 0.211
0.0GlyXaa: 0.0 ± 0.0
His
1.046HisAla: 1.046 ± 0.156
0.328HisCys: 0.328 ± 0.067
1.189HisAsp: 1.189 ± 0.155
0.759HisGlu: 0.759 ± 0.116
1.005HisPhe: 1.005 ± 0.156
0.943HisGly: 0.943 ± 0.14
0.554HisHis: 0.554 ± 0.104
1.579HisIle: 1.579 ± 0.192
1.271HisLys: 1.271 ± 0.176
1.804HisLeu: 1.804 ± 0.14
0.595HisMet: 0.595 ± 0.113
0.8HisAsn: 0.8 ± 0.117
0.984HisPro: 0.984 ± 0.174
0.533HisGln: 0.533 ± 0.083
1.066HisArg: 1.066 ± 0.135
1.128HisSer: 1.128 ± 0.164
1.087HisThr: 1.087 ± 0.185
1.415HisVal: 1.415 ± 0.17
0.226HisTrp: 0.226 ± 0.079
0.902HisTyr: 0.902 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
3.936IleAla: 3.936 ± 0.273
0.718IleCys: 0.718 ± 0.117
4.859IleAsp: 4.859 ± 0.332
4.715IleGlu: 4.715 ± 0.325
1.927IlePhe: 1.927 ± 0.197
3.916IleGly: 3.916 ± 0.275
1.374IleHis: 1.374 ± 0.198
3.649IleIle: 3.649 ± 0.335
4.059IleLys: 4.059 ± 0.272
4.408IleLeu: 4.408 ± 0.328
1.845IleMet: 1.845 ± 0.199
3.67IleAsn: 3.67 ± 0.335
3.096IlePro: 3.096 ± 0.242
2.604IleGln: 2.604 ± 0.264
3.157IleArg: 3.157 ± 0.262
3.854IleSer: 3.854 ± 0.329
4.551IleThr: 4.551 ± 0.358
4.1IleVal: 4.1 ± 0.36
0.779IleTrp: 0.779 ± 0.16
2.153IleTyr: 2.153 ± 0.216
0.0IleXaa: 0.0 ± 0.0
Lys
4.367LysAla: 4.367 ± 0.346
0.513LysCys: 0.513 ± 0.119
3.957LysAsp: 3.957 ± 0.282
4.572LysGlu: 4.572 ± 0.346
3.383LysPhe: 3.383 ± 0.239
3.977LysGly: 3.977 ± 0.393
1.025LysHis: 1.025 ± 0.186
4.018LysIle: 4.018 ± 0.252
4.182LysLys: 4.182 ± 0.384
5.146LysLeu: 5.146 ± 0.358
2.419LysMet: 2.419 ± 0.257
2.747LysAsn: 2.747 ± 0.249
2.645LysPro: 2.645 ± 0.239
2.973LysGln: 2.973 ± 0.237
3.075LysArg: 3.075 ± 0.326
4.285LysSer: 4.285 ± 0.288
3.854LysThr: 3.854 ± 0.241
4.182LysVal: 4.182 ± 0.293
1.046LysTrp: 1.046 ± 0.166
2.296LysTyr: 2.296 ± 0.217
0.0LysXaa: 0.0 ± 0.0
Leu
5.864LeuAla: 5.864 ± 0.369
0.882LeuCys: 0.882 ± 0.138
4.982LeuAsp: 4.982 ± 0.307
5.454LeuGlu: 5.454 ± 0.347
3.608LeuPhe: 3.608 ± 0.282
5.084LeuGly: 5.084 ± 0.302
1.292LeuHis: 1.292 ± 0.168
4.264LeuIle: 4.264 ± 0.296
6.089LeuLys: 6.089 ± 0.415
6.171LeuLeu: 6.171 ± 0.393
1.845LeuMet: 1.845 ± 0.195
4.654LeuAsn: 4.654 ± 0.295
3.506LeuPro: 3.506 ± 0.308
2.911LeuGln: 2.911 ± 0.25
3.916LeuArg: 3.916 ± 0.261
5.495LeuSer: 5.495 ± 0.293
5.023LeuThr: 5.023 ± 0.297
5.351LeuVal: 5.351 ± 0.336
0.759LeuTrp: 0.759 ± 0.162
3.26LeuTyr: 3.26 ± 0.237
0.0LeuXaa: 0.0 ± 0.0
Met
2.645MetAla: 2.645 ± 0.245
0.369MetCys: 0.369 ± 0.095
1.558MetAsp: 1.558 ± 0.171
1.599MetGlu: 1.599 ± 0.16
1.435MetPhe: 1.435 ± 0.166
1.476MetGly: 1.476 ± 0.153
0.39MetHis: 0.39 ± 0.084
1.702MetIle: 1.702 ± 0.157
2.46MetLys: 2.46 ± 0.226
2.583MetLeu: 2.583 ± 0.235
1.148MetMet: 1.148 ± 0.162
1.681MetAsn: 1.681 ± 0.227
1.148MetPro: 1.148 ± 0.156
0.943MetGln: 0.943 ± 0.14
1.763MetArg: 1.763 ± 0.191
2.112MetSer: 2.112 ± 0.189
1.722MetThr: 1.722 ± 0.193
1.702MetVal: 1.702 ± 0.183
0.246MetTrp: 0.246 ± 0.065
0.902MetTyr: 0.902 ± 0.147
0.0MetXaa: 0.0 ± 0.0
Asn
3.957AsnAla: 3.957 ± 0.317
0.656AsnCys: 0.656 ± 0.127
3.178AsnAsp: 3.178 ± 0.259
2.727AsnGlu: 2.727 ± 0.239
2.132AsnPhe: 2.132 ± 0.235
3.998AsnGly: 3.998 ± 0.323
1.189AsnHis: 1.189 ± 0.157
3.096AsnIle: 3.096 ± 0.232
3.301AsnLys: 3.301 ± 0.269
3.752AsnLeu: 3.752 ± 0.294
1.845AsnMet: 1.845 ± 0.189
3.321AsnAsn: 3.321 ± 0.339
2.542AsnPro: 2.542 ± 0.242
2.132AsnGln: 2.132 ± 0.219
2.686AsnArg: 2.686 ± 0.227
3.034AsnSer: 3.034 ± 0.229
2.768AsnThr: 2.768 ± 0.241
3.465AsnVal: 3.465 ± 0.298
0.738AsnTrp: 0.738 ± 0.127
1.886AsnTyr: 1.886 ± 0.214
0.0AsnXaa: 0.0 ± 0.0
Pro
2.46ProAla: 2.46 ± 0.202
0.492ProCys: 0.492 ± 0.097
2.891ProAsp: 2.891 ± 0.229
3.567ProGlu: 3.567 ± 0.333
1.64ProPhe: 1.64 ± 0.176
2.645ProGly: 2.645 ± 0.241
0.636ProHis: 0.636 ± 0.119
2.214ProIle: 2.214 ± 0.249
2.255ProLys: 2.255 ± 0.199
2.973ProLeu: 2.973 ± 0.252
0.902ProMet: 0.902 ± 0.12
1.825ProAsn: 1.825 ± 0.218
1.251ProPro: 1.251 ± 0.165
1.374ProGln: 1.374 ± 0.155
1.886ProArg: 1.886 ± 0.219
2.87ProSer: 2.87 ± 0.265
2.337ProThr: 2.337 ± 0.241
2.665ProVal: 2.665 ± 0.227
0.697ProTrp: 0.697 ± 0.113
1.353ProTyr: 1.353 ± 0.157
0.0ProXaa: 0.0 ± 0.0
Gln
2.583GlnAla: 2.583 ± 0.238
0.349GlnCys: 0.349 ± 0.094
2.03GlnAsp: 2.03 ± 0.212
2.378GlnGlu: 2.378 ± 0.245
1.845GlnPhe: 1.845 ± 0.221
2.153GlnGly: 2.153 ± 0.181
0.8GlnHis: 0.8 ± 0.14
2.768GlnIle: 2.768 ± 0.213
2.194GlnLys: 2.194 ± 0.197
3.096GlnLeu: 3.096 ± 0.283
1.066GlnMet: 1.066 ± 0.141
1.743GlnAsn: 1.743 ± 0.164
1.23GlnPro: 1.23 ± 0.139
1.886GlnGln: 1.886 ± 0.245
2.276GlnArg: 2.276 ± 0.194
2.44GlnSer: 2.44 ± 0.224
2.091GlnThr: 2.091 ± 0.175
2.624GlnVal: 2.624 ± 0.243
0.533GlnTrp: 0.533 ± 0.103
1.476GlnTyr: 1.476 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
3.28ArgAla: 3.28 ± 0.226
0.636ArgCys: 0.636 ± 0.112
2.87ArgAsp: 2.87 ± 0.277
3.034ArgGlu: 3.034 ± 0.229
2.255ArgPhe: 2.255 ± 0.185
2.829ArgGly: 2.829 ± 0.217
1.046ArgHis: 1.046 ± 0.161
3.424ArgIle: 3.424 ± 0.22
3.055ArgLys: 3.055 ± 0.241
4.469ArgLeu: 4.469 ± 0.313
1.681ArgMet: 1.681 ± 0.207
2.563ArgAsn: 2.563 ± 0.244
1.661ArgPro: 1.661 ± 0.217
2.132ArgGln: 2.132 ± 0.19
3.137ArgArg: 3.137 ± 0.309
3.055ArgSer: 3.055 ± 0.258
2.296ArgThr: 2.296 ± 0.249
3.178ArgVal: 3.178 ± 0.272
0.8ArgTrp: 0.8 ± 0.105
2.214ArgTyr: 2.214 ± 0.192
0.0ArgXaa: 0.0 ± 0.0
Ser
3.895SerAla: 3.895 ± 0.348
0.636SerCys: 0.636 ± 0.148
3.854SerAsp: 3.854 ± 0.275
3.977SerGlu: 3.977 ± 0.296
2.768SerPhe: 2.768 ± 0.231
5.454SerGly: 5.454 ± 0.49
0.923SerHis: 0.923 ± 0.154
4.387SerIle: 4.387 ± 0.349
4.121SerLys: 4.121 ± 0.32
5.515SerLeu: 5.515 ± 0.338
1.825SerMet: 1.825 ± 0.198
3.608SerAsn: 3.608 ± 0.278
2.706SerPro: 2.706 ± 0.264
2.419SerGln: 2.419 ± 0.209
3.178SerArg: 3.178 ± 0.27
4.408SerSer: 4.408 ± 0.407
3.649SerThr: 3.649 ± 0.344
4.49SerVal: 4.49 ± 0.371
0.841SerTrp: 0.841 ± 0.129
2.399SerTyr: 2.399 ± 0.228
0.0SerXaa: 0.0 ± 0.0
Thr
3.916ThrAla: 3.916 ± 0.361
0.492ThrCys: 0.492 ± 0.089
3.567ThrAsp: 3.567 ± 0.268
3.813ThrGlu: 3.813 ± 0.35
2.645ThrPhe: 2.645 ± 0.243
4.428ThrGly: 4.428 ± 0.388
1.046ThrHis: 1.046 ± 0.156
3.977ThrIle: 3.977 ± 0.324
3.649ThrLys: 3.649 ± 0.271
4.674ThrLeu: 4.674 ± 0.33
1.374ThrMet: 1.374 ± 0.167
2.563ThrAsn: 2.563 ± 0.234
3.321ThrPro: 3.321 ± 0.241
2.03ThrGln: 2.03 ± 0.19
2.891ThrArg: 2.891 ± 0.237
3.547ThrSer: 3.547 ± 0.373
3.895ThrThr: 3.895 ± 0.32
4.408ThrVal: 4.408 ± 0.378
0.677ThrTrp: 0.677 ± 0.103
1.825ThrTyr: 1.825 ± 0.245
0.0ThrXaa: 0.0 ± 0.0
Val
3.977ValAla: 3.977 ± 0.264
0.82ValCys: 0.82 ± 0.153
4.92ValAsp: 4.92 ± 0.276
4.879ValGlu: 4.879 ± 0.349
2.645ValPhe: 2.645 ± 0.251
4.531ValGly: 4.531 ± 0.326
1.312ValHis: 1.312 ± 0.163
4.387ValIle: 4.387 ± 0.3
4.982ValLys: 4.982 ± 0.322
5.228ValLeu: 5.228 ± 0.285
1.784ValMet: 1.784 ± 0.17
3.567ValAsn: 3.567 ± 0.287
2.255ValPro: 2.255 ± 0.214
2.522ValGln: 2.522 ± 0.252
2.829ValArg: 2.829 ± 0.239
4.982ValSer: 4.982 ± 0.354
4.592ValThr: 4.592 ± 0.382
5.782ValVal: 5.782 ± 0.456
1.087ValTrp: 1.087 ± 0.18
3.055ValTyr: 3.055 ± 0.23
0.0ValXaa: 0.0 ± 0.0
Trp
0.964TrpAla: 0.964 ± 0.131
0.308TrpCys: 0.308 ± 0.088
1.128TrpAsp: 1.128 ± 0.176
1.169TrpGlu: 1.169 ± 0.174
0.779TrpPhe: 0.779 ± 0.122
0.718TrpGly: 0.718 ± 0.118
0.226TrpHis: 0.226 ± 0.069
0.697TrpIle: 0.697 ± 0.153
0.8TrpLys: 0.8 ± 0.134
1.374TrpLeu: 1.374 ± 0.181
0.533TrpMet: 0.533 ± 0.113
0.718TrpAsn: 0.718 ± 0.108
0.41TrpPro: 0.41 ± 0.1
0.349TrpGln: 0.349 ± 0.089
0.882TrpArg: 0.882 ± 0.146
0.636TrpSer: 0.636 ± 0.113
0.656TrpThr: 0.656 ± 0.114
1.148TrpVal: 1.148 ± 0.161
0.164TrpTrp: 0.164 ± 0.056
0.431TrpTyr: 0.431 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.522TyrAla: 2.522 ± 0.213
0.533TyrCys: 0.533 ± 0.114
2.624TyrAsp: 2.624 ± 0.242
2.337TyrGlu: 2.337 ± 0.192
1.763TyrPhe: 1.763 ± 0.184
2.542TyrGly: 2.542 ± 0.236
1.005TyrHis: 1.005 ± 0.127
2.071TyrIle: 2.071 ± 0.249
2.153TyrLys: 2.153 ± 0.183
2.973TyrLeu: 2.973 ± 0.273
1.046TyrMet: 1.046 ± 0.124
2.399TyrAsn: 2.399 ± 0.19
1.661TyrPro: 1.661 ± 0.182
1.415TyrGln: 1.415 ± 0.176
2.071TyrArg: 2.071 ± 0.178
2.522TyrSer: 2.522 ± 0.216
2.112TyrThr: 2.112 ± 0.322
3.137TyrVal: 3.137 ± 0.244
0.451TyrTrp: 0.451 ± 0.105
1.435TyrTyr: 1.435 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 210 proteins (48777 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski