Amino acid dipepetide frequency for Aeromonas phage CF8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.672AlaAla: 3.672 ± 0.391
0.53AlaCys: 0.53 ± 0.096
3.278AlaAsp: 3.278 ± 0.203
3.645AlaGlu: 3.645 ± 0.24
2.339AlaPhe: 2.339 ± 0.205
4.406AlaGly: 4.406 ± 0.29
1.115AlaHis: 1.115 ± 0.111
4.366AlaIle: 4.366 ± 0.255
4.284AlaLys: 4.284 ± 0.309
5.277AlaLeu: 5.277 ± 0.308
1.618AlaMet: 1.618 ± 0.143
3.169AlaAsn: 3.169 ± 0.233
1.999AlaPro: 1.999 ± 0.176
2.679AlaGln: 2.679 ± 0.206
2.625AlaArg: 2.625 ± 0.23
3.658AlaSer: 3.658 ± 0.255
3.033AlaThr: 3.033 ± 0.217
4.257AlaVal: 4.257 ± 0.252
0.694AlaTrp: 0.694 ± 0.1
1.958AlaTyr: 1.958 ± 0.172
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.096
0.177CysCys: 0.177 ± 0.05
0.68CysAsp: 0.68 ± 0.093
0.598CysGlu: 0.598 ± 0.09
0.68CysPhe: 0.68 ± 0.108
0.544CysGly: 0.544 ± 0.095
0.19CysHis: 0.19 ± 0.054
0.53CysIle: 0.53 ± 0.087
0.857CysLys: 0.857 ± 0.13
0.734CysLeu: 0.734 ± 0.102
0.231CysMet: 0.231 ± 0.051
0.394CysAsn: 0.394 ± 0.076
0.422CysPro: 0.422 ± 0.068
0.408CysGln: 0.408 ± 0.074
0.626CysArg: 0.626 ± 0.079
0.558CysSer: 0.558 ± 0.086
0.449CysThr: 0.449 ± 0.07
0.694CysVal: 0.694 ± 0.099
0.177CysTrp: 0.177 ± 0.053
0.53CysTyr: 0.53 ± 0.074
0.0CysXaa: 0.0 ± 0.0
Asp
3.427AspAla: 3.427 ± 0.234
0.612AspCys: 0.612 ± 0.086
3.4AspAsp: 3.4 ± 0.212
3.767AspGlu: 3.767 ± 0.246
3.06AspPhe: 3.06 ± 0.24
3.822AspGly: 3.822 ± 0.254
1.387AspHis: 1.387 ± 0.137
4.42AspIle: 4.42 ± 0.258
4.175AspLys: 4.175 ± 0.249
6.066AspLeu: 6.066 ± 0.296
1.727AspMet: 1.727 ± 0.143
3.563AspAsn: 3.563 ± 0.173
3.482AspPro: 3.482 ± 0.29
2.516AspGln: 2.516 ± 0.168
2.366AspArg: 2.366 ± 0.161
3.21AspSer: 3.21 ± 0.228
3.414AspThr: 3.414 ± 0.231
3.876AspVal: 3.876 ± 0.205
0.911AspTrp: 0.911 ± 0.116
2.938AspTyr: 2.938 ± 0.203
0.0AspXaa: 0.0 ± 0.0
Glu
3.305GluAla: 3.305 ± 0.245
0.49GluCys: 0.49 ± 0.073
3.318GluAsp: 3.318 ± 0.229
4.678GluGlu: 4.678 ± 0.32
3.414GluPhe: 3.414 ± 0.199
3.604GluGly: 3.604 ± 0.233
1.7GluHis: 1.7 ± 0.173
4.57GluIle: 4.57 ± 0.255
3.794GluLys: 3.794 ± 0.265
6.705GluLeu: 6.705 ± 0.354
1.904GluMet: 1.904 ± 0.158
3.346GluAsn: 3.346 ± 0.183
1.741GluPro: 1.741 ± 0.179
3.427GluGln: 3.427 ± 0.233
3.128GluArg: 3.128 ± 0.223
4.012GluSer: 4.012 ± 0.256
4.311GluThr: 4.311 ± 0.267
4.352GluVal: 4.352 ± 0.266
0.68GluTrp: 0.68 ± 0.102
2.638GluTyr: 2.638 ± 0.179
0.0GluXaa: 0.0 ± 0.0
Phe
2.366PheAla: 2.366 ± 0.183
0.626PheCys: 0.626 ± 0.098
3.536PheAsp: 3.536 ± 0.219
2.611PheGlu: 2.611 ± 0.2
2.135PhePhe: 2.135 ± 0.164
3.604PheGly: 3.604 ± 0.211
0.857PheHis: 0.857 ± 0.123
3.114PheIle: 3.114 ± 0.216
3.754PheLys: 3.754 ± 0.219
3.25PheLeu: 3.25 ± 0.231
1.442PheMet: 1.442 ± 0.169
3.645PheAsn: 3.645 ± 0.248
1.89PhePro: 1.89 ± 0.183
1.414PheGln: 1.414 ± 0.132
1.673PheArg: 1.673 ± 0.153
3.495PheSer: 3.495 ± 0.195
2.638PheThr: 2.638 ± 0.165
3.087PheVal: 3.087 ± 0.174
0.49PheTrp: 0.49 ± 0.083
2.067PheTyr: 2.067 ± 0.194
0.0PheXaa: 0.0 ± 0.0
Gly
3.074GlyAla: 3.074 ± 0.24
0.762GlyCys: 0.762 ± 0.104
3.876GlyAsp: 3.876 ± 0.273
4.23GlyGlu: 4.23 ± 0.253
3.468GlyPhe: 3.468 ± 0.202
4.42GlyGly: 4.42 ± 0.37
1.17GlyHis: 1.17 ± 0.141
4.706GlyIle: 4.706 ± 0.235
4.814GlyLys: 4.814 ± 0.279
5.984GlyLeu: 5.984 ± 0.343
2.04GlyMet: 2.04 ± 0.165
3.618GlyAsn: 3.618 ± 0.27
1.482GlyPro: 1.482 ± 0.125
2.149GlyGln: 2.149 ± 0.178
3.006GlyArg: 3.006 ± 0.268
4.624GlySer: 4.624 ± 0.303
3.917GlyThr: 3.917 ± 0.252
4.298GlyVal: 4.298 ± 0.277
1.074GlyTrp: 1.074 ± 0.119
2.965GlyTyr: 2.965 ± 0.201
0.0GlyXaa: 0.0 ± 0.0
His
1.02HisAla: 1.02 ± 0.118
0.258HisCys: 0.258 ± 0.063
1.142HisAsp: 1.142 ± 0.139
1.197HisGlu: 1.197 ± 0.15
1.115HisPhe: 1.115 ± 0.12
1.129HisGly: 1.129 ± 0.127
0.394HisHis: 0.394 ± 0.077
1.251HisIle: 1.251 ± 0.151
1.265HisLys: 1.265 ± 0.128
1.918HisLeu: 1.918 ± 0.178
0.503HisMet: 0.503 ± 0.087
0.938HisAsn: 0.938 ± 0.113
1.142HisPro: 1.142 ± 0.132
0.843HisGln: 0.843 ± 0.102
1.142HisArg: 1.142 ± 0.125
1.074HisSer: 1.074 ± 0.099
0.952HisThr: 0.952 ± 0.108
1.319HisVal: 1.319 ± 0.146
0.245HisTrp: 0.245 ± 0.059
0.87HisTyr: 0.87 ± 0.123
0.0HisXaa: 0.0 ± 0.0
Ile
4.597IleAla: 4.597 ± 0.247
0.598IleCys: 0.598 ± 0.106
5.059IleAsp: 5.059 ± 0.27
4.379IleGlu: 4.379 ± 0.219
2.434IlePhe: 2.434 ± 0.213
3.862IleGly: 3.862 ± 0.28
1.292IleHis: 1.292 ± 0.118
3.509IleIle: 3.509 ± 0.263
5.195IleLys: 5.195 ± 0.307
4.434IleLeu: 4.434 ± 0.303
1.346IleMet: 1.346 ± 0.131
4.107IleAsn: 4.107 ± 0.247
3.291IlePro: 3.291 ± 0.274
2.217IleGln: 2.217 ± 0.177
3.4IleArg: 3.4 ± 0.209
4.529IleSer: 4.529 ± 0.268
4.202IleThr: 4.202 ± 0.245
3.808IleVal: 3.808 ± 0.244
0.666IleTrp: 0.666 ± 0.11
2.475IleTyr: 2.475 ± 0.15
0.0IleXaa: 0.0 ± 0.0
Lys
4.638LysAla: 4.638 ± 0.309
0.462LysCys: 0.462 ± 0.077
4.134LysAsp: 4.134 ± 0.256
6.079LysGlu: 6.079 ± 0.391
2.761LysPhe: 2.761 ± 0.19
3.849LysGly: 3.849 ± 0.247
1.578LysHis: 1.578 ± 0.178
3.618LysIle: 3.618 ± 0.229
3.332LysLys: 3.332 ± 0.229
6.773LysLeu: 6.773 ± 0.369
1.714LysMet: 1.714 ± 0.152
2.747LysAsn: 2.747 ± 0.179
2.543LysPro: 2.543 ± 0.176
3.101LysGln: 3.101 ± 0.229
2.611LysArg: 2.611 ± 0.176
3.604LysSer: 3.604 ± 0.216
4.148LysThr: 4.148 ± 0.189
4.638LysVal: 4.638 ± 0.287
0.694LysTrp: 0.694 ± 0.104
2.584LysTyr: 2.584 ± 0.229
0.0LysXaa: 0.0 ± 0.0
Leu
5.413LeuAla: 5.413 ± 0.314
0.993LeuCys: 0.993 ± 0.133
6.12LeuAsp: 6.12 ± 0.261
5.685LeuGlu: 5.685 ± 0.283
3.998LeuPhe: 3.998 ± 0.231
5.766LeuGly: 5.766 ± 0.302
1.387LeuHis: 1.387 ± 0.139
5.712LeuIle: 5.712 ± 0.298
5.902LeuLys: 5.902 ± 0.347
7.29LeuLeu: 7.29 ± 0.349
2.584LeuMet: 2.584 ± 0.196
5.508LeuAsn: 5.508 ± 0.221
3.305LeuPro: 3.305 ± 0.213
2.965LeuGln: 2.965 ± 0.19
4.121LeuArg: 4.121 ± 0.219
6.256LeuSer: 6.256 ± 0.374
5.943LeuThr: 5.943 ± 0.352
6.406LeuVal: 6.406 ± 0.339
0.775LeuTrp: 0.775 ± 0.123
3.264LeuTyr: 3.264 ± 0.211
0.0LeuXaa: 0.0 ± 0.0
Met
1.795MetAla: 1.795 ± 0.187
0.122MetCys: 0.122 ± 0.047
1.686MetAsp: 1.686 ± 0.182
1.85MetGlu: 1.85 ± 0.165
1.36MetPhe: 1.36 ± 0.117
1.727MetGly: 1.727 ± 0.155
0.34MetHis: 0.34 ± 0.065
1.564MetIle: 1.564 ± 0.142
1.618MetLys: 1.618 ± 0.138
2.339MetLeu: 2.339 ± 0.182
0.789MetMet: 0.789 ± 0.11
1.89MetAsn: 1.89 ± 0.155
0.857MetPro: 0.857 ± 0.102
0.911MetGln: 0.911 ± 0.106
0.938MetArg: 0.938 ± 0.123
2.067MetSer: 2.067 ± 0.163
1.428MetThr: 1.428 ± 0.165
1.89MetVal: 1.89 ± 0.148
0.258MetTrp: 0.258 ± 0.059
1.02MetTyr: 1.02 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
3.386AsnAla: 3.386 ± 0.193
0.449AsnCys: 0.449 ± 0.098
2.679AsnAsp: 2.679 ± 0.203
3.495AsnGlu: 3.495 ± 0.248
2.326AsnPhe: 2.326 ± 0.183
4.352AsnGly: 4.352 ± 0.348
1.061AsnHis: 1.061 ± 0.115
3.563AsnIle: 3.563 ± 0.229
3.876AsnLys: 3.876 ± 0.242
4.692AsnLeu: 4.692 ± 0.257
1.224AsnMet: 1.224 ± 0.127
3.754AsnAsn: 3.754 ± 0.282
3.427AsnPro: 3.427 ± 0.208
2.149AsnGln: 2.149 ± 0.198
2.666AsnArg: 2.666 ± 0.213
3.155AsnSer: 3.155 ± 0.257
3.414AsnThr: 3.414 ± 0.202
3.74AsnVal: 3.74 ± 0.278
0.83AsnTrp: 0.83 ± 0.131
2.122AsnTyr: 2.122 ± 0.155
0.0AsnXaa: 0.0 ± 0.0
Pro
2.326ProAla: 2.326 ± 0.182
0.326ProCys: 0.326 ± 0.064
2.842ProAsp: 2.842 ± 0.226
3.21ProGlu: 3.21 ± 0.192
1.836ProPhe: 1.836 ± 0.178
2.638ProGly: 2.638 ± 0.197
0.748ProHis: 0.748 ± 0.098
2.938ProIle: 2.938 ± 0.234
2.135ProLys: 2.135 ± 0.159
2.938ProLeu: 2.938 ± 0.222
0.83ProMet: 0.83 ± 0.101
2.312ProAsn: 2.312 ± 0.173
1.129ProPro: 1.129 ± 0.112
1.591ProGln: 1.591 ± 0.148
1.55ProArg: 1.55 ± 0.153
2.598ProSer: 2.598 ± 0.18
2.611ProThr: 2.611 ± 0.193
3.427ProVal: 3.427 ± 0.247
0.462ProTrp: 0.462 ± 0.076
1.387ProTyr: 1.387 ± 0.136
0.0ProXaa: 0.0 ± 0.0
Gln
2.761GlnAla: 2.761 ± 0.19
0.381GlnCys: 0.381 ± 0.066
1.958GlnAsp: 1.958 ± 0.164
2.57GlnGlu: 2.57 ± 0.227
2.285GlnPhe: 2.285 ± 0.18
2.298GlnGly: 2.298 ± 0.163
0.762GlnHis: 0.762 ± 0.09
2.761GlnIle: 2.761 ± 0.188
1.782GlnLys: 1.782 ± 0.17
4.066GlnLeu: 4.066 ± 0.28
1.142GlnMet: 1.142 ± 0.11
1.7GlnAsn: 1.7 ± 0.218
1.387GlnPro: 1.387 ± 0.126
1.659GlnGln: 1.659 ± 0.152
1.809GlnArg: 1.809 ± 0.159
2.149GlnSer: 2.149 ± 0.163
2.394GlnThr: 2.394 ± 0.188
2.87GlnVal: 2.87 ± 0.228
0.449GlnTrp: 0.449 ± 0.073
1.618GlnTyr: 1.618 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
2.734ArgAla: 2.734 ± 0.215
0.503ArgCys: 0.503 ± 0.093
3.046ArgAsp: 3.046 ± 0.232
2.693ArgGlu: 2.693 ± 0.208
2.434ArgPhe: 2.434 ± 0.149
2.638ArgGly: 2.638 ± 0.24
0.884ArgHis: 0.884 ± 0.11
2.965ArgIle: 2.965 ± 0.228
2.666ArgLys: 2.666 ± 0.184
4.325ArgLeu: 4.325 ± 0.262
1.251ArgMet: 1.251 ± 0.135
2.258ArgAsn: 2.258 ± 0.15
1.387ArgPro: 1.387 ± 0.141
1.55ArgGln: 1.55 ± 0.146
1.958ArgArg: 1.958 ± 0.16
2.625ArgSer: 2.625 ± 0.205
2.761ArgThr: 2.761 ± 0.203
3.196ArgVal: 3.196 ± 0.192
0.544ArgTrp: 0.544 ± 0.081
2.135ArgTyr: 2.135 ± 0.168
0.0ArgXaa: 0.0 ± 0.0
Ser
3.645SerAla: 3.645 ± 0.217
0.598SerCys: 0.598 ± 0.106
3.699SerAsp: 3.699 ± 0.191
3.699SerGlu: 3.699 ± 0.215
3.536SerPhe: 3.536 ± 0.297
4.597SerGly: 4.597 ± 0.392
1.074SerHis: 1.074 ± 0.116
4.366SerIle: 4.366 ± 0.217
4.121SerLys: 4.121 ± 0.245
6.147SerLeu: 6.147 ± 0.314
1.578SerMet: 1.578 ± 0.111
3.019SerAsn: 3.019 ± 0.21
2.516SerPro: 2.516 ± 0.159
2.122SerGln: 2.122 ± 0.156
2.53SerArg: 2.53 ± 0.175
3.971SerSer: 3.971 ± 0.346
3.781SerThr: 3.781 ± 0.254
4.651SerVal: 4.651 ± 0.209
0.707SerTrp: 0.707 ± 0.084
2.638SerTyr: 2.638 ± 0.225
0.0SerXaa: 0.0 ± 0.0
Thr
3.686ThrAla: 3.686 ± 0.219
0.585ThrCys: 0.585 ± 0.098
3.468ThrAsp: 3.468 ± 0.194
3.468ThrGlu: 3.468 ± 0.222
2.87ThrPhe: 2.87 ± 0.216
4.678ThrGly: 4.678 ± 0.296
1.455ThrHis: 1.455 ± 0.143
3.822ThrIle: 3.822 ± 0.25
3.482ThrLys: 3.482 ± 0.238
5.658ThrLeu: 5.658 ± 0.242
1.659ThrMet: 1.659 ± 0.13
2.87ThrAsn: 2.87 ± 0.223
3.06ThrPro: 3.06 ± 0.172
2.23ThrGln: 2.23 ± 0.161
2.666ThrArg: 2.666 ± 0.172
3.781ThrSer: 3.781 ± 0.205
3.386ThrThr: 3.386 ± 0.234
4.515ThrVal: 4.515 ± 0.228
0.626ThrTrp: 0.626 ± 0.092
2.23ThrTyr: 2.23 ± 0.205
0.0ThrXaa: 0.0 ± 0.0
Val
3.618ValAla: 3.618 ± 0.254
0.925ValCys: 0.925 ± 0.114
4.76ValAsp: 4.76 ± 0.238
4.284ValGlu: 4.284 ± 0.268
2.938ValPhe: 2.938 ± 0.22
4.311ValGly: 4.311 ± 0.243
1.251ValHis: 1.251 ± 0.152
4.597ValIle: 4.597 ± 0.265
5.29ValLys: 5.29 ± 0.285
5.794ValLeu: 5.794 ± 0.268
1.646ValMet: 1.646 ± 0.135
4.515ValAsn: 4.515 ± 0.252
2.638ValPro: 2.638 ± 0.182
2.448ValGln: 2.448 ± 0.219
3.019ValArg: 3.019 ± 0.203
4.461ValSer: 4.461 ± 0.221
4.284ValThr: 4.284 ± 0.26
4.801ValVal: 4.801 ± 0.285
0.734ValTrp: 0.734 ± 0.115
2.938ValTyr: 2.938 ± 0.236
0.0ValXaa: 0.0 ± 0.0
Trp
0.435TrpAla: 0.435 ± 0.078
0.204TrpCys: 0.204 ± 0.048
0.925TrpAsp: 0.925 ± 0.121
0.857TrpGlu: 0.857 ± 0.119
0.639TrpPhe: 0.639 ± 0.105
0.762TrpGly: 0.762 ± 0.091
0.231TrpHis: 0.231 ± 0.059
0.843TrpIle: 0.843 ± 0.114
0.734TrpLys: 0.734 ± 0.094
1.142TrpLeu: 1.142 ± 0.125
0.272TrpMet: 0.272 ± 0.063
0.734TrpAsn: 0.734 ± 0.101
0.245TrpPro: 0.245 ± 0.057
0.394TrpGln: 0.394 ± 0.074
0.517TrpArg: 0.517 ± 0.079
0.68TrpSer: 0.68 ± 0.087
0.598TrpThr: 0.598 ± 0.091
0.87TrpVal: 0.87 ± 0.099
0.163TrpTrp: 0.163 ± 0.048
0.462TrpTyr: 0.462 ± 0.074
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.203TyrAla: 2.203 ± 0.161
0.381TyrCys: 0.381 ± 0.059
2.53TyrAsp: 2.53 ± 0.184
2.122TyrGlu: 2.122 ± 0.171
2.067TyrPhe: 2.067 ± 0.166
2.666TyrGly: 2.666 ± 0.194
0.843TyrHis: 0.843 ± 0.092
2.19TyrIle: 2.19 ± 0.141
2.53TyrLys: 2.53 ± 0.211
3.862TyrLeu: 3.862 ± 0.23
0.938TyrMet: 0.938 ± 0.117
2.271TyrAsn: 2.271 ± 0.187
1.904TyrPro: 1.904 ± 0.156
2.026TyrGln: 2.026 ± 0.174
2.203TyrArg: 2.203 ± 0.194
2.489TyrSer: 2.489 ± 0.21
2.53TyrThr: 2.53 ± 0.18
2.557TyrVal: 2.557 ± 0.193
0.53TyrTrp: 0.53 ± 0.073
1.686TyrTyr: 1.686 ± 0.167
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 244 proteins (73531 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski