Amino acid dipepetide frequency for Pseudoalteromonas phage J2-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.553AlaAla: 4.553 ± 0.474
0.723AlaCys: 0.723 ± 0.132
4.023AlaAsp: 4.023 ± 0.319
4.143AlaGlu: 4.143 ± 0.407
2.746AlaPhe: 2.746 ± 0.224
3.927AlaGly: 3.927 ± 0.333
0.915AlaHis: 0.915 ± 0.131
4.649AlaIle: 4.649 ± 0.335
5.396AlaLys: 5.396 ± 0.563
6.36AlaLeu: 6.36 ± 0.407
2.313AlaMet: 2.313 ± 0.198
3.397AlaAsn: 3.397 ± 0.281
1.975AlaPro: 1.975 ± 0.234
2.867AlaGln: 2.867 ± 0.289
3.011AlaArg: 3.011 ± 0.249
3.83AlaSer: 3.83 ± 0.295
4.818AlaThr: 4.818 ± 0.353
4.168AlaVal: 4.168 ± 0.364
0.626AlaTrp: 0.626 ± 0.105
2.337AlaTyr: 2.337 ± 0.228
0.0AlaXaa: 0.0 ± 0.0
Cys
0.554CysAla: 0.554 ± 0.114
0.145CysCys: 0.145 ± 0.064
0.458CysAsp: 0.458 ± 0.11
0.795CysGlu: 0.795 ± 0.145
0.458CysPhe: 0.458 ± 0.082
0.964CysGly: 0.964 ± 0.168
0.337CysHis: 0.337 ± 0.107
0.554CysIle: 0.554 ± 0.096
0.626CysLys: 0.626 ± 0.108
0.626CysLeu: 0.626 ± 0.148
0.41CysMet: 0.41 ± 0.11
0.506CysAsn: 0.506 ± 0.097
0.458CysPro: 0.458 ± 0.098
0.434CysGln: 0.434 ± 0.097
0.506CysArg: 0.506 ± 0.114
0.795CysSer: 0.795 ± 0.139
0.699CysThr: 0.699 ± 0.142
0.867CysVal: 0.867 ± 0.145
0.145CysTrp: 0.145 ± 0.062
0.602CysTyr: 0.602 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
4.553AspAla: 4.553 ± 0.353
0.723AspCys: 0.723 ± 0.175
3.83AspAsp: 3.83 ± 0.311
4.457AspGlu: 4.457 ± 0.326
3.614AspPhe: 3.614 ± 0.288
5.685AspGly: 5.685 ± 0.494
1.229AspHis: 1.229 ± 0.184
4.818AspIle: 4.818 ± 0.307
4.071AspLys: 4.071 ± 0.338
6.263AspLeu: 6.263 ± 0.402
1.469AspMet: 1.469 ± 0.171
3.541AspAsn: 3.541 ± 0.25
2.168AspPro: 2.168 ± 0.27
2.024AspGln: 2.024 ± 0.293
2.505AspArg: 2.505 ± 0.222
3.541AspSer: 3.541 ± 0.307
3.999AspThr: 3.999 ± 0.386
3.927AspVal: 3.927 ± 0.362
1.132AspTrp: 1.132 ± 0.169
2.987AspTyr: 2.987 ± 0.284
0.0AspXaa: 0.0 ± 0.0
Glu
4.722GluAla: 4.722 ± 0.424
0.795GluCys: 0.795 ± 0.144
5.685GluAsp: 5.685 ± 0.384
6.914GluGlu: 6.914 ± 0.507
2.481GluPhe: 2.481 ± 0.233
4.722GluGly: 4.722 ± 0.317
1.445GluHis: 1.445 ± 0.194
4.384GluIle: 4.384 ± 0.294
5.107GluLys: 5.107 ± 0.415
7.01GluLeu: 7.01 ± 0.536
1.927GluMet: 1.927 ± 0.215
4.264GluAsn: 4.264 ± 0.36
1.518GluPro: 1.518 ± 0.198
3.132GluGln: 3.132 ± 0.302
2.77GluArg: 2.77 ± 0.295
3.493GluSer: 3.493 ± 0.286
3.83GluThr: 3.83 ± 0.288
4.963GluVal: 4.963 ± 0.254
1.325GluTrp: 1.325 ± 0.222
3.3GluTyr: 3.3 ± 0.254
0.0GluXaa: 0.0 ± 0.0
Phe
2.457PheAla: 2.457 ± 0.219
0.53PheCys: 0.53 ± 0.104
3.397PheAsp: 3.397 ± 0.281
3.349PheGlu: 3.349 ± 0.256
1.927PhePhe: 1.927 ± 0.235
3.445PheGly: 3.445 ± 0.278
0.891PheHis: 0.891 ± 0.154
2.578PheIle: 2.578 ± 0.236
2.891PheLys: 2.891 ± 0.257
3.324PheLeu: 3.324 ± 0.305
0.867PheMet: 0.867 ± 0.152
2.12PheAsn: 2.12 ± 0.248
1.253PhePro: 1.253 ± 0.157
1.108PheGln: 1.108 ± 0.158
1.831PheArg: 1.831 ± 0.187
3.156PheSer: 3.156 ± 0.278
2.819PheThr: 2.819 ± 0.303
3.3PheVal: 3.3 ± 0.311
0.771PheTrp: 0.771 ± 0.152
1.662PheTyr: 1.662 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
4.168GlyAla: 4.168 ± 0.325
0.699GlyCys: 0.699 ± 0.12
3.686GlyAsp: 3.686 ± 0.382
3.734GlyGlu: 3.734 ± 0.281
2.963GlyPhe: 2.963 ± 0.236
4.433GlyGly: 4.433 ± 0.417
1.036GlyHis: 1.036 ± 0.143
4.143GlyIle: 4.143 ± 0.341
5.155GlyLys: 5.155 ± 0.363
4.577GlyLeu: 4.577 ± 0.305
1.614GlyMet: 1.614 ± 0.187
3.662GlyAsn: 3.662 ± 0.328
1.012GlyPro: 1.012 ± 0.151
2.192GlyGln: 2.192 ± 0.2
2.698GlyArg: 2.698 ± 0.213
4.553GlySer: 4.553 ± 0.354
4.023GlyThr: 4.023 ± 0.553
4.481GlyVal: 4.481 ± 0.315
0.843GlyTrp: 0.843 ± 0.127
3.373GlyTyr: 3.373 ± 0.318
0.0GlyXaa: 0.0 ± 0.0
His
0.891HisAla: 0.891 ± 0.155
0.385HisCys: 0.385 ± 0.109
0.819HisAsp: 0.819 ± 0.132
0.843HisGlu: 0.843 ± 0.163
1.036HisPhe: 1.036 ± 0.162
0.867HisGly: 0.867 ± 0.153
0.337HisHis: 0.337 ± 0.097
1.132HisIle: 1.132 ± 0.142
1.927HisLys: 1.927 ± 0.242
1.783HisLeu: 1.783 ± 0.248
0.145HisMet: 0.145 ± 0.065
1.277HisAsn: 1.277 ± 0.191
1.012HisPro: 1.012 ± 0.142
0.265HisGln: 0.265 ± 0.084
0.94HisArg: 0.94 ± 0.173
0.988HisSer: 0.988 ± 0.163
1.469HisThr: 1.469 ± 0.166
0.747HisVal: 0.747 ± 0.129
0.217HisTrp: 0.217 ± 0.085
1.036HisTyr: 1.036 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
3.782IleAla: 3.782 ± 0.309
0.771IleCys: 0.771 ± 0.125
5.083IleAsp: 5.083 ± 0.352
4.963IleGlu: 4.963 ± 0.271
2.698IlePhe: 2.698 ± 0.3
3.421IleGly: 3.421 ± 0.307
1.156IleHis: 1.156 ± 0.138
3.156IleIle: 3.156 ± 0.271
4.264IleLys: 4.264 ± 0.284
4.794IleLeu: 4.794 ± 0.398
1.253IleMet: 1.253 ± 0.15
3.686IleAsn: 3.686 ± 0.284
2.698IlePro: 2.698 ± 0.239
1.951IleGln: 1.951 ± 0.249
2.505IleArg: 2.505 ± 0.229
3.951IleSer: 3.951 ± 0.279
4.192IleThr: 4.192 ± 0.338
4.216IleVal: 4.216 ± 0.309
0.385IleTrp: 0.385 ± 0.101
2.457IleTyr: 2.457 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
6.071LysAla: 6.071 ± 0.533
0.361LysCys: 0.361 ± 0.093
4.601LysAsp: 4.601 ± 0.369
6.408LysGlu: 6.408 ± 0.488
2.505LysPhe: 2.505 ± 0.267
4.457LysGly: 4.457 ± 0.297
1.734LysHis: 1.734 ± 0.198
3.614LysIle: 3.614 ± 0.306
4.987LysLys: 4.987 ± 0.471
6.047LysLeu: 6.047 ± 0.503
1.759LysMet: 1.759 ± 0.224
3.204LysAsn: 3.204 ± 0.285
2.674LysPro: 2.674 ± 0.248
3.059LysGln: 3.059 ± 0.256
3.011LysArg: 3.011 ± 0.295
3.927LysSer: 3.927 ± 0.301
4.673LysThr: 4.673 ± 0.292
5.131LysVal: 5.131 ± 0.342
0.482LysTrp: 0.482 ± 0.107
2.216LysTyr: 2.216 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
6.047LeuAla: 6.047 ± 0.352
1.156LeuCys: 1.156 ± 0.16
5.637LeuAsp: 5.637 ± 0.376
7.444LeuGlu: 7.444 ± 0.421
3.758LeuPhe: 3.758 ± 0.339
5.179LeuGly: 5.179 ± 0.357
0.964LeuHis: 0.964 ± 0.13
4.36LeuIle: 4.36 ± 0.323
5.589LeuLys: 5.589 ± 0.441
7.107LeuLeu: 7.107 ± 0.49
2.12LeuMet: 2.12 ± 0.238
3.758LeuAsn: 3.758 ± 0.282
3.349LeuPro: 3.349 ± 0.246
3.517LeuGln: 3.517 ± 0.269
3.71LeuArg: 3.71 ± 0.308
6.191LeuSer: 6.191 ± 0.399
5.324LeuThr: 5.324 ± 0.323
6.601LeuVal: 6.601 ± 0.374
0.867LeuTrp: 0.867 ± 0.137
2.963LeuTyr: 2.963 ± 0.244
0.0LeuXaa: 0.0 ± 0.0
Met
1.783MetAla: 1.783 ± 0.225
0.145MetCys: 0.145 ± 0.06
1.156MetAsp: 1.156 ± 0.177
1.734MetGlu: 1.734 ± 0.198
0.964MetPhe: 0.964 ± 0.149
1.518MetGly: 1.518 ± 0.194
0.289MetHis: 0.289 ± 0.08
1.566MetIle: 1.566 ± 0.211
1.831MetLys: 1.831 ± 0.214
1.831MetLeu: 1.831 ± 0.243
0.626MetMet: 0.626 ± 0.118
1.301MetAsn: 1.301 ± 0.186
0.94MetPro: 0.94 ± 0.148
1.084MetGln: 1.084 ± 0.159
1.036MetArg: 1.036 ± 0.161
2.024MetSer: 2.024 ± 0.232
1.759MetThr: 1.759 ± 0.216
1.879MetVal: 1.879 ± 0.208
0.217MetTrp: 0.217 ± 0.073
0.819MetTyr: 0.819 ± 0.129
0.0MetXaa: 0.0 ± 0.0
Asn
3.276AsnAla: 3.276 ± 0.258
0.41AsnCys: 0.41 ± 0.108
3.011AsnAsp: 3.011 ± 0.309
2.794AsnGlu: 2.794 ± 0.275
2.216AsnPhe: 2.216 ± 0.231
3.228AsnGly: 3.228 ± 0.361
1.084AsnHis: 1.084 ± 0.136
3.638AsnIle: 3.638 ± 0.308
3.83AsnLys: 3.83 ± 0.311
5.107AsnLeu: 5.107 ± 0.31
1.301AsnMet: 1.301 ± 0.186
3.397AsnAsn: 3.397 ± 0.365
2.289AsnPro: 2.289 ± 0.204
1.614AsnGln: 1.614 ± 0.212
2.096AsnArg: 2.096 ± 0.231
3.011AsnSer: 3.011 ± 0.274
3.903AsnThr: 3.903 ± 0.393
3.397AsnVal: 3.397 ± 0.285
0.843AsnTrp: 0.843 ± 0.123
2.385AsnTyr: 2.385 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
1.59ProAla: 1.59 ± 0.22
0.385ProCys: 0.385 ± 0.107
3.469ProAsp: 3.469 ± 0.275
3.276ProGlu: 3.276 ± 0.302
1.421ProPhe: 1.421 ± 0.19
0.0ProGly: 0.0 ± 0.0
0.675ProHis: 0.675 ± 0.14
1.975ProIle: 1.975 ± 0.233
2.602ProLys: 2.602 ± 0.294
2.843ProLeu: 2.843 ± 0.288
0.94ProMet: 0.94 ± 0.122
1.662ProAsn: 1.662 ± 0.19
1.06ProPro: 1.06 ± 0.219
1.132ProGln: 1.132 ± 0.173
1.156ProArg: 1.156 ± 0.158
2.048ProSer: 2.048 ± 0.205
2.746ProThr: 2.746 ± 0.268
2.626ProVal: 2.626 ± 0.269
0.385ProTrp: 0.385 ± 0.087
1.349ProTyr: 1.349 ± 0.183
0.0ProXaa: 0.0 ± 0.0
Gln
3.084GlnAla: 3.084 ± 0.314
0.41GlnCys: 0.41 ± 0.096
2.168GlnAsp: 2.168 ± 0.204
3.204GlnGlu: 3.204 ± 0.326
1.469GlnPhe: 1.469 ± 0.221
1.831GlnGly: 1.831 ± 0.17
0.771GlnHis: 0.771 ± 0.124
2.457GlnIle: 2.457 ± 0.233
2.529GlnLys: 2.529 ± 0.26
3.276GlnLeu: 3.276 ± 0.243
1.253GlnMet: 1.253 ± 0.177
1.759GlnAsn: 1.759 ± 0.209
1.301GlnPro: 1.301 ± 0.228
1.855GlnGln: 1.855 ± 0.289
1.59GlnArg: 1.59 ± 0.206
1.373GlnSer: 1.373 ± 0.2
1.927GlnThr: 1.927 ± 0.182
2.578GlnVal: 2.578 ± 0.252
0.482GlnTrp: 0.482 ± 0.114
1.132GlnTyr: 1.132 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
2.65ArgAla: 2.65 ± 0.25
0.361ArgCys: 0.361 ± 0.098
2.915ArgAsp: 2.915 ± 0.236
2.578ArgGlu: 2.578 ± 0.299
1.951ArgPhe: 1.951 ± 0.206
2.746ArgGly: 2.746 ± 0.261
0.699ArgHis: 0.699 ± 0.115
2.819ArgIle: 2.819 ± 0.285
2.819ArgLys: 2.819 ± 0.26
3.349ArgLeu: 3.349 ± 0.249
1.156ArgMet: 1.156 ± 0.19
1.855ArgAsn: 1.855 ± 0.254
1.397ArgPro: 1.397 ± 0.157
1.879ArgGln: 1.879 ± 0.213
1.662ArgArg: 1.662 ± 0.246
2.361ArgSer: 2.361 ± 0.192
2.385ArgThr: 2.385 ± 0.251
3.108ArgVal: 3.108 ± 0.292
0.578ArgTrp: 0.578 ± 0.121
1.807ArgTyr: 1.807 ± 0.193
0.0ArgXaa: 0.0 ± 0.0
Ser
3.927SerAla: 3.927 ± 0.298
0.747SerCys: 0.747 ± 0.137
4.505SerAsp: 4.505 ± 0.414
4.023SerGlu: 4.023 ± 0.356
3.156SerPhe: 3.156 ± 0.298
4.938SerGly: 4.938 ± 0.55
1.108SerHis: 1.108 ± 0.176
3.565SerIle: 3.565 ± 0.361
4.192SerLys: 4.192 ± 0.321
5.733SerLeu: 5.733 ± 0.333
1.494SerMet: 1.494 ± 0.183
3.373SerAsn: 3.373 ± 0.381
1.831SerPro: 1.831 ± 0.184
2.072SerGln: 2.072 ± 0.215
2.144SerArg: 2.144 ± 0.239
4.866SerSer: 4.866 ± 0.512
3.638SerThr: 3.638 ± 0.337
4.408SerVal: 4.408 ± 0.349
0.891SerTrp: 0.891 ± 0.136
2.867SerTyr: 2.867 ± 0.228
0.0SerXaa: 0.0 ± 0.0
Thr
4.264ThrAla: 4.264 ± 0.373
0.65ThrCys: 0.65 ± 0.133
3.999ThrAsp: 3.999 ± 0.421
4.168ThrGlu: 4.168 ± 0.345
3.059ThrPhe: 3.059 ± 0.236
3.903ThrGly: 3.903 ± 0.38
1.325ThrHis: 1.325 ± 0.167
4.722ThrIle: 4.722 ± 0.381
4.433ThrLys: 4.433 ± 0.317
5.396ThrLeu: 5.396 ± 0.391
0.988ThrMet: 0.988 ± 0.167
3.614ThrAsn: 3.614 ± 0.369
2.626ThrPro: 2.626 ± 0.242
2.77ThrGln: 2.77 ± 0.295
2.987ThrArg: 2.987 ± 0.274
4.143ThrSer: 4.143 ± 0.399
4.577ThrThr: 4.577 ± 0.49
4.625ThrVal: 4.625 ± 0.387
0.867ThrTrp: 0.867 ± 0.175
1.927ThrTyr: 1.927 ± 0.199
0.0ThrXaa: 0.0 ± 0.0
Val
5.035ValAla: 5.035 ± 0.348
0.843ValCys: 0.843 ± 0.125
4.433ValAsp: 4.433 ± 0.355
5.444ValGlu: 5.444 ± 0.336
2.819ValPhe: 2.819 ± 0.269
4.36ValGly: 4.36 ± 0.289
0.891ValHis: 0.891 ± 0.171
4.24ValIle: 4.24 ± 0.341
5.059ValLys: 5.059 ± 0.341
5.276ValLeu: 5.276 ± 0.331
1.71ValMet: 1.71 ± 0.217
3.662ValAsn: 3.662 ± 0.317
1.975ValPro: 1.975 ± 0.245
1.951ValGln: 1.951 ± 0.235
2.65ValArg: 2.65 ± 0.234
5.998ValSer: 5.998 ± 0.411
4.746ValThr: 4.746 ± 0.396
5.661ValVal: 5.661 ± 0.455
0.867ValTrp: 0.867 ± 0.116
2.578ValTyr: 2.578 ± 0.224
0.0ValXaa: 0.0 ± 0.0
Trp
0.843TrpAla: 0.843 ± 0.14
0.265TrpCys: 0.265 ± 0.076
0.964TrpAsp: 0.964 ± 0.168
1.036TrpGlu: 1.036 ± 0.166
0.747TrpPhe: 0.747 ± 0.128
0.747TrpGly: 0.747 ± 0.162
0.217TrpHis: 0.217 ± 0.059
0.675TrpIle: 0.675 ± 0.166
0.843TrpLys: 0.843 ± 0.161
1.132TrpLeu: 1.132 ± 0.172
0.265TrpMet: 0.265 ± 0.083
0.482TrpAsn: 0.482 ± 0.092
0.217TrpPro: 0.217 ± 0.09
0.217TrpGln: 0.217 ± 0.061
0.675TrpArg: 0.675 ± 0.125
0.867TrpSer: 0.867 ± 0.153
0.602TrpThr: 0.602 ± 0.129
1.012TrpVal: 1.012 ± 0.144
0.145TrpTrp: 0.145 ± 0.051
0.602TrpTyr: 0.602 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.529TyrAla: 2.529 ± 0.259
0.434TyrCys: 0.434 ± 0.099
2.722TyrAsp: 2.722 ± 0.253
2.481TyrGlu: 2.481 ± 0.227
1.686TyrPhe: 1.686 ± 0.182
2.674TyrGly: 2.674 ± 0.252
1.012TyrHis: 1.012 ± 0.17
2.337TyrIle: 2.337 ± 0.194
2.77TyrLys: 2.77 ± 0.271
3.806TyrLeu: 3.806 ± 0.319
0.867TyrMet: 0.867 ± 0.136
2.313TyrAsn: 2.313 ± 0.221
1.59TyrPro: 1.59 ± 0.187
1.325TyrGln: 1.325 ± 0.168
1.59TyrArg: 1.59 ± 0.175
2.361TyrSer: 2.361 ± 0.26
2.915TyrThr: 2.915 ± 0.221
2.505TyrVal: 2.505 ± 0.266
0.482TyrTrp: 0.482 ± 0.103
1.325TyrTyr: 1.325 ± 0.165
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 180 proteins (41512 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski