Amino acid dipepetide frequency for Stenotrophomonas phage IME13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.259AlaAla: 5.259 ± 0.428
0.556AlaCys: 0.556 ± 0.108
4.425AlaAsp: 4.425 ± 0.318
4.842AlaGlu: 4.842 ± 0.387
2.34AlaPhe: 2.34 ± 0.235
4.564AlaGly: 4.564 ± 0.416
1.413AlaHis: 1.413 ± 0.168
5.143AlaIle: 5.143 ± 0.321
5.467AlaLys: 5.467 ± 0.374
5.491AlaLeu: 5.491 ± 0.356
1.668AlaMet: 1.668 ± 0.181
4.031AlaAsn: 4.031 ± 0.306
2.085AlaPro: 2.085 ± 0.241
2.85AlaGln: 2.85 ± 0.326
3.313AlaArg: 3.313 ± 0.294
3.985AlaSer: 3.985 ± 0.333
3.938AlaThr: 3.938 ± 0.428
4.564AlaVal: 4.564 ± 0.32
1.066AlaTrp: 1.066 ± 0.156
2.734AlaTyr: 2.734 ± 0.266
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.137
0.232CysCys: 0.232 ± 0.074
0.88CysAsp: 0.88 ± 0.136
0.649CysGlu: 0.649 ± 0.125
0.487CysPhe: 0.487 ± 0.11
0.602CysGly: 0.602 ± 0.133
0.255CysHis: 0.255 ± 0.076
0.579CysIle: 0.579 ± 0.125
0.95CysLys: 0.95 ± 0.133
0.672CysLeu: 0.672 ± 0.114
0.232CysMet: 0.232 ± 0.069
0.51CysAsn: 0.51 ± 0.107
0.626CysPro: 0.626 ± 0.115
0.301CysGln: 0.301 ± 0.074
0.695CysArg: 0.695 ± 0.129
0.811CysSer: 0.811 ± 0.162
0.649CysThr: 0.649 ± 0.112
0.741CysVal: 0.741 ± 0.128
0.139CysTrp: 0.139 ± 0.051
0.44CysTyr: 0.44 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
3.985AspAla: 3.985 ± 0.272
0.579AspCys: 0.579 ± 0.102
4.193AspAsp: 4.193 ± 0.382
5.074AspGlu: 5.074 ± 0.34
3.568AspPhe: 3.568 ± 0.281
4.633AspGly: 4.633 ± 0.309
0.996AspHis: 0.996 ± 0.174
5.375AspIle: 5.375 ± 0.364
3.243AspLys: 3.243 ± 0.275
4.448AspLeu: 4.448 ± 0.329
2.178AspMet: 2.178 ± 0.258
2.78AspAsn: 2.78 ± 0.225
2.201AspPro: 2.201 ± 0.234
1.992AspGln: 1.992 ± 0.181
2.641AspArg: 2.641 ± 0.223
3.568AspSer: 3.568 ± 0.29
3.29AspThr: 3.29 ± 0.256
4.193AspVal: 4.193 ± 0.306
1.228AspTrp: 1.228 ± 0.217
2.664AspTyr: 2.664 ± 0.266
0.0AspXaa: 0.0 ± 0.0
Glu
4.587GluAla: 4.587 ± 0.368
0.834GluCys: 0.834 ± 0.148
3.197GluAsp: 3.197 ± 0.275
4.448GluGlu: 4.448 ± 0.401
3.406GluPhe: 3.406 ± 0.273
3.406GluGly: 3.406 ± 0.292
1.483GluHis: 1.483 ± 0.168
5.722GluIle: 5.722 ± 0.432
4.935GluLys: 4.935 ± 0.393
6.325GluLeu: 6.325 ± 0.396
2.409GluMet: 2.409 ± 0.254
4.008GluAsn: 4.008 ± 0.322
1.691GluPro: 1.691 ± 0.197
2.572GluGln: 2.572 ± 0.276
3.336GluArg: 3.336 ± 0.261
4.124GluSer: 4.124 ± 0.309
3.104GluThr: 3.104 ± 0.297
4.518GluVal: 4.518 ± 0.339
1.066GluTrp: 1.066 ± 0.175
3.128GluTyr: 3.128 ± 0.272
0.0GluXaa: 0.0 ± 0.0
Phe
3.151PheAla: 3.151 ± 0.32
0.579PheCys: 0.579 ± 0.111
3.707PheAsp: 3.707 ± 0.318
3.637PheGlu: 3.637 ± 0.289
1.344PhePhe: 1.344 ± 0.181
2.965PheGly: 2.965 ± 0.33
0.741PheHis: 0.741 ± 0.124
2.734PheIle: 2.734 ± 0.258
3.313PheLys: 3.313 ± 0.266
2.479PheLeu: 2.479 ± 0.288
1.135PheMet: 1.135 ± 0.218
2.641PheAsn: 2.641 ± 0.261
1.182PhePro: 1.182 ± 0.151
1.182PheGln: 1.182 ± 0.177
1.83PheArg: 1.83 ± 0.187
3.313PheSer: 3.313 ± 0.292
2.479PheThr: 2.479 ± 0.226
3.359PheVal: 3.359 ± 0.288
0.51PheTrp: 0.51 ± 0.115
1.344PheTyr: 1.344 ± 0.161
0.0PheXaa: 0.0 ± 0.0
Gly
3.915GlyAla: 3.915 ± 0.358
0.765GlyCys: 0.765 ± 0.131
4.054GlyAsp: 4.054 ± 0.391
3.846GlyGlu: 3.846 ± 0.316
2.386GlyPhe: 2.386 ± 0.275
3.684GlyGly: 3.684 ± 0.345
1.274GlyHis: 1.274 ± 0.199
4.494GlyIle: 4.494 ± 0.317
4.842GlyLys: 4.842 ± 0.352
4.587GlyLeu: 4.587 ± 0.349
1.83GlyMet: 1.83 ± 0.199
3.29GlyAsn: 3.29 ± 0.341
1.506GlyPro: 1.506 ± 0.162
2.386GlyGln: 2.386 ± 0.253
2.687GlyArg: 2.687 ± 0.233
4.286GlySer: 4.286 ± 0.389
3.985GlyThr: 3.985 ± 0.335
4.147GlyVal: 4.147 ± 0.311
0.857GlyTrp: 0.857 ± 0.153
3.151GlyTyr: 3.151 ± 0.238
0.0GlyXaa: 0.0 ± 0.0
His
1.089HisAla: 1.089 ± 0.157
0.348HisCys: 0.348 ± 0.085
1.599HisAsp: 1.599 ± 0.201
1.39HisGlu: 1.39 ± 0.174
1.043HisPhe: 1.043 ± 0.156
1.39HisGly: 1.39 ± 0.149
0.51HisHis: 0.51 ± 0.102
1.83HisIle: 1.83 ± 0.19
1.344HisLys: 1.344 ± 0.158
1.853HisLeu: 1.853 ± 0.261
0.371HisMet: 0.371 ± 0.089
0.904HisAsn: 0.904 ± 0.156
0.88HisPro: 0.88 ± 0.139
0.579HisGln: 0.579 ± 0.127
0.811HisArg: 0.811 ± 0.168
1.112HisSer: 1.112 ± 0.16
0.88HisThr: 0.88 ± 0.15
1.158HisVal: 1.158 ± 0.161
0.371HisTrp: 0.371 ± 0.107
0.695HisTyr: 0.695 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
5.352IleAla: 5.352 ± 0.382
0.996IleCys: 0.996 ± 0.148
5.328IleAsp: 5.328 ± 0.345
4.494IleGlu: 4.494 ± 0.297
2.757IlePhe: 2.757 ± 0.217
3.962IleGly: 3.962 ± 0.319
1.019IleHis: 1.019 ± 0.17
4.24IleIle: 4.24 ± 0.331
5.004IleLys: 5.004 ± 0.316
4.402IleLeu: 4.402 ± 0.382
1.784IleMet: 1.784 ± 0.198
4.216IleAsn: 4.216 ± 0.32
2.919IlePro: 2.919 ± 0.278
2.456IleGln: 2.456 ± 0.204
3.707IleArg: 3.707 ± 0.327
4.842IleSer: 4.842 ± 0.394
4.842IleThr: 4.842 ± 0.342
4.796IleVal: 4.796 ± 0.326
0.626IleTrp: 0.626 ± 0.086
2.456IleTyr: 2.456 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
5.606LysAla: 5.606 ± 0.427
0.602LysCys: 0.602 ± 0.124
3.568LysAsp: 3.568 ± 0.305
4.471LysGlu: 4.471 ± 0.396
3.29LysPhe: 3.29 ± 0.273
3.73LysGly: 3.73 ± 0.308
1.83LysHis: 1.83 ± 0.241
4.749LysIle: 4.749 ± 0.315
4.17LysLys: 4.17 ± 0.42
5.398LysLeu: 5.398 ± 0.36
2.386LysMet: 2.386 ± 0.235
4.124LysAsn: 4.124 ± 0.316
2.085LysPro: 2.085 ± 0.212
2.572LysGln: 2.572 ± 0.248
3.382LysArg: 3.382 ± 0.348
4.865LysSer: 4.865 ± 0.328
4.402LysThr: 4.402 ± 0.369
3.892LysVal: 3.892 ± 0.322
0.927LysTrp: 0.927 ± 0.153
2.896LysTyr: 2.896 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
5.444LeuAla: 5.444 ± 0.398
0.765LeuCys: 0.765 ± 0.13
4.541LeuAsp: 4.541 ± 0.327
4.888LeuGlu: 4.888 ± 0.401
3.104LeuPhe: 3.104 ± 0.28
4.564LeuGly: 4.564 ± 0.289
1.112LeuHis: 1.112 ± 0.14
4.24LeuIle: 4.24 ± 0.294
5.884LeuLys: 5.884 ± 0.371
4.587LeuLeu: 4.587 ± 0.374
2.479LeuMet: 2.479 ± 0.182
4.448LeuAsn: 4.448 ± 0.291
3.267LeuPro: 3.267 ± 0.262
2.595LeuGln: 2.595 ± 0.231
3.938LeuArg: 3.938 ± 0.294
5.213LeuSer: 5.213 ± 0.355
3.591LeuThr: 3.591 ± 0.297
4.749LeuVal: 4.749 ± 0.273
0.533LeuTrp: 0.533 ± 0.106
2.989LeuTyr: 2.989 ± 0.289
0.0LeuXaa: 0.0 ± 0.0
Met
2.27MetAla: 2.27 ± 0.192
0.232MetCys: 0.232 ± 0.079
1.321MetAsp: 1.321 ± 0.177
1.622MetGlu: 1.622 ± 0.21
1.668MetPhe: 1.668 ± 0.169
1.83MetGly: 1.83 ± 0.214
0.765MetHis: 0.765 ± 0.127
2.016MetIle: 2.016 ± 0.236
2.34MetLys: 2.34 ± 0.239
2.618MetLeu: 2.618 ± 0.247
0.741MetMet: 0.741 ± 0.137
1.483MetAsn: 1.483 ± 0.189
1.158MetPro: 1.158 ± 0.152
1.135MetGln: 1.135 ± 0.174
1.483MetArg: 1.483 ± 0.2
2.548MetSer: 2.548 ± 0.221
1.877MetThr: 1.877 ± 0.19
1.297MetVal: 1.297 ± 0.151
0.301MetTrp: 0.301 ± 0.089
1.043MetTyr: 1.043 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
3.66AsnAla: 3.66 ± 0.278
0.394AsnCys: 0.394 ± 0.104
3.012AsnAsp: 3.012 ± 0.287
3.614AsnGlu: 3.614 ± 0.294
2.734AsnPhe: 2.734 ± 0.241
4.147AsnGly: 4.147 ± 0.338
1.251AsnHis: 1.251 ± 0.173
3.776AsnIle: 3.776 ± 0.306
3.568AsnLys: 3.568 ± 0.287
3.962AsnLeu: 3.962 ± 0.322
1.691AsnMet: 1.691 ± 0.221
3.359AsnAsn: 3.359 ± 0.331
3.035AsnPro: 3.035 ± 0.295
2.016AsnGln: 2.016 ± 0.223
2.711AsnArg: 2.711 ± 0.262
3.938AsnSer: 3.938 ± 0.263
3.197AsnThr: 3.197 ± 0.345
3.452AsnVal: 3.452 ± 0.272
0.788AsnTrp: 0.788 ± 0.143
2.108AsnTyr: 2.108 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
2.456ProAla: 2.456 ± 0.253
0.324ProCys: 0.324 ± 0.085
2.409ProAsp: 2.409 ± 0.26
3.128ProGlu: 3.128 ± 0.279
1.436ProPhe: 1.436 ± 0.14
2.479ProGly: 2.479 ± 0.248
0.718ProHis: 0.718 ± 0.121
2.456ProIle: 2.456 ± 0.218
2.317ProLys: 2.317 ± 0.246
2.178ProLeu: 2.178 ± 0.226
1.066ProMet: 1.066 ± 0.154
2.224ProAsn: 2.224 ± 0.225
0.904ProPro: 0.904 ± 0.163
0.834ProGln: 0.834 ± 0.123
1.529ProArg: 1.529 ± 0.195
2.803ProSer: 2.803 ± 0.267
1.969ProThr: 1.969 ± 0.212
2.409ProVal: 2.409 ± 0.29
0.626ProTrp: 0.626 ± 0.107
1.946ProTyr: 1.946 ± 0.224
0.0ProXaa: 0.0 ± 0.0
Gln
2.803GlnAla: 2.803 ± 0.234
0.348GlnCys: 0.348 ± 0.093
1.39GlnAsp: 1.39 ± 0.216
1.714GlnGlu: 1.714 ± 0.22
1.529GlnPhe: 1.529 ± 0.167
1.529GlnGly: 1.529 ± 0.196
0.718GlnHis: 0.718 ± 0.127
2.595GlnIle: 2.595 ± 0.251
2.386GlnLys: 2.386 ± 0.239
3.382GlnLeu: 3.382 ± 0.256
1.251GlnMet: 1.251 ± 0.157
1.923GlnAsn: 1.923 ± 0.195
1.089GlnPro: 1.089 ± 0.174
1.297GlnGln: 1.297 ± 0.194
1.39GlnArg: 1.39 ± 0.189
2.062GlnSer: 2.062 ± 0.192
2.363GlnThr: 2.363 ± 0.285
2.456GlnVal: 2.456 ± 0.247
0.417GlnTrp: 0.417 ± 0.103
1.83GlnTyr: 1.83 ± 0.195
0.0GlnXaa: 0.0 ± 0.0
Arg
3.267ArgAla: 3.267 ± 0.304
0.95ArgCys: 0.95 ± 0.149
3.104ArgAsp: 3.104 ± 0.234
3.498ArgGlu: 3.498 ± 0.299
2.016ArgPhe: 2.016 ± 0.232
2.826ArgGly: 2.826 ± 0.257
1.205ArgHis: 1.205 ± 0.169
3.545ArgIle: 3.545 ± 0.23
3.22ArgLys: 3.22 ± 0.369
3.614ArgLeu: 3.614 ± 0.324
1.506ArgMet: 1.506 ± 0.191
2.896ArgAsn: 2.896 ± 0.267
1.251ArgPro: 1.251 ± 0.156
2.131ArgGln: 2.131 ± 0.229
2.201ArgArg: 2.201 ± 0.263
2.618ArgSer: 2.618 ± 0.264
2.479ArgThr: 2.479 ± 0.27
3.22ArgVal: 3.22 ± 0.3
0.996ArgTrp: 0.996 ± 0.144
2.062ArgTyr: 2.062 ± 0.186
0.0ArgXaa: 0.0 ± 0.0
Ser
4.494SerAla: 4.494 ± 0.335
0.695SerCys: 0.695 ± 0.144
4.17SerAsp: 4.17 ± 0.29
4.24SerGlu: 4.24 ± 0.321
2.826SerPhe: 2.826 ± 0.251
4.703SerGly: 4.703 ± 0.348
1.436SerHis: 1.436 ± 0.182
5.143SerIle: 5.143 ± 0.338
4.124SerLys: 4.124 ± 0.287
4.355SerLeu: 4.355 ± 0.277
1.413SerMet: 1.413 ± 0.182
3.452SerAsn: 3.452 ± 0.313
2.456SerPro: 2.456 ± 0.232
2.108SerGln: 2.108 ± 0.226
3.707SerArg: 3.707 ± 0.226
4.564SerSer: 4.564 ± 0.352
3.799SerThr: 3.799 ± 0.422
4.633SerVal: 4.633 ± 0.343
0.788SerTrp: 0.788 ± 0.138
2.734SerTyr: 2.734 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
3.776ThrAla: 3.776 ± 0.444
0.533ThrCys: 0.533 ± 0.117
3.243ThrAsp: 3.243 ± 0.313
3.753ThrGlu: 3.753 ± 0.336
2.525ThrPhe: 2.525 ± 0.237
3.962ThrGly: 3.962 ± 0.336
0.973ThrHis: 0.973 ± 0.138
3.869ThrIle: 3.869 ± 0.296
3.429ThrLys: 3.429 ± 0.274
4.425ThrLeu: 4.425 ± 0.294
1.691ThrMet: 1.691 ± 0.167
3.128ThrAsn: 3.128 ± 0.292
3.058ThrPro: 3.058 ± 0.255
1.877ThrGln: 1.877 ± 0.188
2.965ThrArg: 2.965 ± 0.315
4.286ThrSer: 4.286 ± 0.425
3.012ThrThr: 3.012 ± 0.289
4.402ThrVal: 4.402 ± 0.379
0.834ThrTrp: 0.834 ± 0.139
1.83ThrTyr: 1.83 ± 0.236
0.0ThrXaa: 0.0 ± 0.0
Val
3.799ValAla: 3.799 ± 0.291
0.857ValCys: 0.857 ± 0.162
4.564ValAsp: 4.564 ± 0.328
5.213ValGlu: 5.213 ± 0.363
2.803ValPhe: 2.803 ± 0.309
4.077ValGly: 4.077 ± 0.339
1.251ValHis: 1.251 ± 0.176
4.633ValIle: 4.633 ± 0.371
5.166ValLys: 5.166 ± 0.381
4.355ValLeu: 4.355 ± 0.312
1.877ValMet: 1.877 ± 0.205
3.892ValAsn: 3.892 ± 0.33
2.525ValPro: 2.525 ± 0.266
1.691ValGln: 1.691 ± 0.235
3.197ValArg: 3.197 ± 0.239
4.101ValSer: 4.101 ± 0.247
3.985ValThr: 3.985 ± 0.371
4.564ValVal: 4.564 ± 0.309
0.788ValTrp: 0.788 ± 0.143
3.058ValTyr: 3.058 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
0.996TrpAla: 0.996 ± 0.141
0.232TrpCys: 0.232 ± 0.069
1.043TrpAsp: 1.043 ± 0.15
0.996TrpGlu: 0.996 ± 0.152
0.672TrpPhe: 0.672 ± 0.107
0.51TrpGly: 0.51 ± 0.101
0.278TrpHis: 0.278 ± 0.087
0.811TrpIle: 0.811 ± 0.126
1.112TrpLys: 1.112 ± 0.149
0.95TrpLeu: 0.95 ± 0.139
0.672TrpMet: 0.672 ± 0.116
0.556TrpAsn: 0.556 ± 0.125
0.487TrpPro: 0.487 ± 0.11
0.579TrpGln: 0.579 ± 0.11
0.649TrpArg: 0.649 ± 0.107
0.765TrpSer: 0.765 ± 0.141
0.626TrpThr: 0.626 ± 0.117
0.973TrpVal: 0.973 ± 0.137
0.185TrpTrp: 0.185 ± 0.065
0.649TrpTyr: 0.649 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.012TyrAla: 3.012 ± 0.308
0.44TyrCys: 0.44 ± 0.114
3.081TyrAsp: 3.081 ± 0.296
2.78TyrGlu: 2.78 ± 0.242
1.691TyrPhe: 1.691 ± 0.181
2.409TyrGly: 2.409 ± 0.269
0.927TyrHis: 0.927 ± 0.155
2.247TyrIle: 2.247 ± 0.236
2.131TyrLys: 2.131 ± 0.224
2.85TyrLeu: 2.85 ± 0.266
1.344TyrMet: 1.344 ± 0.174
2.572TyrAsn: 2.572 ± 0.242
1.807TyrPro: 1.807 ± 0.185
1.321TyrGln: 1.321 ± 0.175
2.433TyrArg: 2.433 ± 0.224
2.039TyrSer: 2.039 ± 0.225
3.104TyrThr: 3.104 ± 0.242
2.873TyrVal: 2.873 ± 0.252
0.695TyrTrp: 0.695 ± 0.149
1.321TyrTyr: 1.321 ± 0.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 178 proteins (43166 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski